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Description 

This invention is in the field of genetic engineering. More particularly, it relates to recombinant viral 
proteins associated with lymphadenopathy syndrome and/or acquired immune deficiency syndrome. 

5 With the discovery of human T-cell lymphotropic Virus-I (HTLV-I) as an infectious agent in humans, it 
was established that retroviruses could infect humans and could be the etiological agent of disease. After 
HTLV-I was established, a second retrovirus of the same family, HTLV-II was found in a hairy cell leukemia 
established strain. Since that time, other human retroviruses have been isolated which are associated with 
lymphadenopathy syndrome (LAS) and/or acquired immune deficiency syndrome (AIDS) victims. Various 

10 retroviruses have been isolated from individuals with AIDS (sometimes called HTLV-III) or LAS (sometimes 
called LAV). See for example, Barre-Sinoussi, et al, Science (1983) 220:868-871 and Montagnier, et al, Cold 
Spring Harbor Symposium (1984) in press; Vllmer, et al, Lancet (1984) 1:753, Popovic, et al, Science - 
(1984) 224:497 and Gallo, et al, Science (1984) 224:500. A comparison of HTLV-III and LAV may be found 
in Feorino, et al, (1984), supra. See also, Kjatzman, et al, Science (1984) 225:59-62, Montagnier, et al, ibid 

75 (1984) 63-66, and the references cited therein for a survey of the field. A~general discussion of the T-cell 
leukemia viruses may be found in Marx, Science (1984) 22:475-477. Levy, et al, Science (1984) 225:840- 
842 report the isolation of ARV (AIDS-associated retroviruses). — — 

At the time of filing this application, these viruses (HTLV-III, LAV, and ARV) were generically referred to 
as human T-cell lymphotropic retrovirus (hTLR). However, from 1986 onwards the equivalent generic term 

20 "human immundeficiency virus" (HIV) was adopted as the recognised term for such viruses. Subsequently, 
a sub-division of the generic group HIV was necessary into HIV-I and HIV-II. Since the application relates to 
ARV-2 isolates which are HIV-I isolates, HIV-I has been substituted throughout the application for hTLR 
(HIV) and the claims are accordingly limited to HIV-I. The HIVs (hTLRS) may be shown to be of the same 
class by being similar in their morphology, serology, reverse transcriptase optima and cytopathology, as 

25 identified in the above references. For example, the reverse transcriptase prefers Mg +2 , and has a pH 
optima of about 7.8. 

DNA clones containing HIV sequences are disclosed in EP-A1 -01 73529, EP-A1 -01 78978, EP-A2- 
01 85444 and WO 86/02383. 

The present invention provides the following: 
30 A recombinant DNA construct useful for the expression of a recombinant polypeptide in a cell 
containing the construct, the construct comprising control sequences which regulate transcription and 
translation of the recombinant polypeptide in the cell and a coding sequence regulated by the control 
sequences, wherein the coding sequence comprises a DNA sequence of at least about 21 bp in reading 
frame characterised in that the DNA sequence encodes an antigenic HIV-I amino acid sequence of Figure 2 
35 which sequence is immunologically non-cross-reactive with HTLV-I and HTLV-II and is reactive with HIV-I. 

A cell comprising a recombinant DNA construct described above wherein the cell expresses the 
antigenic HIV-I amino acid sequence and is free from other cells which do not express the antigenic HIV-I 
amino acid sequence. 

A method of producing a recombinant polypeptide comprising an antigenic HIV-I amino acid sequence 
40 wherein a population of the cells described above is cultured under conditions whereby the recombinant 
polypeptide is expressed. 

An immunoassay for detecting antibodies to HIV-I in a sample suspected of containing the antibodies, 
characterised in that at least one recombinant polypeptide is used to bind the antibodies and the 
recombinant polypeptide comprises an antigenic env, gag or pol HIV-I amino acid sequence contained in 
45 the sequence shown in Figure 2 which is immunologically non-cross-reactive with HTLV-I and HTLV-II. 

A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant 
DNA construct the construct comprising a DNA sequence which encodes an amino acid sequence from an 
env, gag or pol polypeptide of HIV-I. 

An article of manufacture for use in an immunoassay for HIV-I antibodies characterised in that it 
so comprises a solid support having bound thereto a recombinant polypeptide described above. 

Figure 1 is a restriction map of proviral DNA (ARV-2). 

Figure 2 is the nucleotide sequence of ARV-2(9B). The amino acid sequences for the products of the 
gag, pol, and env genes are indicated. The U3, R, and U5 regions of the LTRs are also designated.. The cap 
site is position + 1 . A 3 bp inverted repeat at the ends of the LTR, the TATA box at position -29, the 
55 sequence complementary to the 3'-end of the tRNA 1 * 5 at position 183, and the polyadenylation signal at 
position 9174 are underlined. The overiines indicate the amino sequences determined from virion proteins. 
The nucleotides at the beginning of each line are numbered, and the amino acids at the end of each line 
are indicated. 
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Figure 3 is a flow diagram showing the procedures for making the plasmid pGAG25-10. 

Figure 4 is the nucleotide sequence of the p25 gag gene cloned in plasmid pGAG25-10 and the amino 
acid sequence encoded by that gene. 

Figure 5 is the coding strand of the nucleotide sequence cloned in pGAG41-10 for producing the fusion 
5 protein p41 gag and the corresponding amino acid. 

Figure 6 is a nucleotide sequence coding for ARV-2 p16 gag protein that was cloned into plasmid ptacS 
to make an expression plasmid for producing p16 gag protein in bacteria. 

Figure 7 is a nucleotide sequence that encodes ARV-2 env protein that was used to prepare plasmid 
pDPC303. 

10 Figure 8 is a nucleotide sequence that encodes ARV-2 p31 protein and is contained in plasmid pTP31 . 
The HIV-I DNA sequences, either isolated and cloned from proviral DNA or cDNA or synthesized, may 
be used for expression of polypeptides which may be a precursor protein subject to further manipulation by 
cleavage, or a complete mature protein or fragment thereof. The smallest sequence of interest, so as to 
provide a sequence encoding an amino acid sequence capable of specific binding to a receptor, e.g., an 

75 immunoglobulin, will be 21 bp, usually at least 45 bp, exclusive of the initiation codon. The sequence may 
code for any greater portion of or the complete polypeptide, or may include flanking regions of a precursor 
polypeptide, so as to include portions of sequences or entire sequences coding for two or more different 
mature polypeptides. The sequence will usually be less than about 5 kbp, more usually less than about 3 
kbp. 

20 Sequences of particular interest having open reading frames (Figure 2) define the structural genes for 
the gag proteins (p16 and p25), the env protein, and the pol protein (p31). It is to be understood that the 
above sequences may be spliced to other sequences present in the retrovirus, so that the 5'-end of the 
sequence may not code for the N-terminal amino acid of the expression product. The splice site may be at 
the 5'-terminus of the open reading frame or internal to the open reading frame. The initiation codon for the 

25 protein may not be the first codon for methionine, but may be the second or third methionine, so that 
employing the entire sequence indicated above may result in an extended protein. However, for the gag 
and env genes there will be proteolytic processing in mammalian cells, which processing may include the 
removal of extra amino acids. 

In isolating the different domains the provirus may be digested with restriction endonucleases, the 

30 fragments electrophoresed and fragments having the proper size and duplexing with a probe, when 
available, are isolated, cloned in a cloning vector, and excised from the vector. The fragments may then be 
manipulated for expression. Superfluous nucleotides may be removed from one or both termini using Bal31 
digestion. By restriction mapping convenient restriction sites may be located external or internal to the 
coding region. Primer repair or in vitro mutagenesis may be employed for defining a terminus, for 

35 insertions, deletion, point or multiple" mutations, or the like, where codons may be changed, either cryptic or 
changing the amino acid, restriction sites introduced or removed, or the like. Where the gene has been 
truncated, the lost nucleotides may be replaced using an adaptor. Adaptors are particularly useful for joining 
coding regions to ensure the proper reading frame. 

The env domain of the HIV-I genome can be obtained by digestion of the provirus with EcoRI and Kpnl 

40 and purification of a 3300 base pair (bp) fragment, which fragment contains about 400 bp 6T5 7 non-coding 
and about 200 bp of 3' non-coding region. Three different methionines coded for by the sequence in the 5' 
end of the open reading frame may serve as translational initiation sites. 

Digestion of proviral sequences with Sacl and EcoRV provides a fragment of about 2300 bp which 
contains the gag domain and a second smliTbpen reldTng frame towards the 3' end of the gag region. The 

45 gag domain is about 1500 bp and codes for a large precursor protein which is processed to yield proteins 
of about 25,000 (p25), 16,000 (p16) and 12,000 (p12) daltons. Digestion with Sacl and Bglll may also be 
used to obtain exclusively the gag domain with p12, p25 and partial p16 regions" 

Digestion of the previous with Kpnl and Sstl provides a fragment containing the portion of the pol 
domain that encodes p31. — 

so The polypeptides which are expressed by the above DNA sequences may find use in a variety of ways. 
The polypeptides or immunologically active fragments thereof, may find use as diagnostic reagents, being 
used in labeled or unlabeled form or immobilized (i.e., bound to a solid surface), as vaccines, in the 
production of monoclonal antibodies, e.g., inhibiting antibodies, or the like. 

The DNA sequences may be joined with other sequences, such as viruses, e.g., vaccinia virus or 

55 adenovirus, to be used for vaccination. Particularly, the DNA sequence of the viral antigen may be inserted 
into the vaccinia virus at a site where it can be expressed, so as to provide an antigen of HIV-I recognized 
as an immunogen by the host. The gag, pol, or env genes or fragments thereof that encode immunogens 
can be used. 
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Another alternative is to join the gag, env, or pol regions or portions thereof to HBsAg gene or pre-S 
HBsAg gene or immunogenic portions thereof, which portion is capable of forming particles in a unicellular 
microorganism host, e.g., yeast or mammalian cells. Thus, particles are formed which will present the HIV-I 
immunogen to the host in immunogenic form, when the host is vaccinated with assembled particles. 

5 As vaccines, the various forms of the immunogen can be administered in a variety of ways, orally, 
parenteral^, intravenously, intra-arterially, subcutaneously, or the like. Usually, these will be provided in a 
physiologically acceptable vehicle, generally distilled water, phosphate-buffered saline, physiological saline, 
or the like. Various adjuvants may be included, such as aluminum hydroxide, and the dosages, number of 
times of administration and manner of administration determined empirically. 

10 In order to obtain the HIV-I sequence, virus can be pelleted from the supernatant of an infected host 
cell. A 9 kb RNA species is purified by electrophoresis of the viral RNA in low-melting agarose gels, 
followed by phenol extraction. The purified RNA may then be used as a template with random primers in a 
reverse transcriptase reaction. The resulting cDNA is then screened for hybridization to polyA+ RNA from 
infected and uninfected cells. Hybridization occurring from infected, but not uninfected cells, is related to 

75 the HIV-I. 

Genomic DNA from infected cells can be restriction enzyme digested and used to prepare a 
bacteriophage library. Based upon restriction analysis of the previously obtained fragments of the retrovirus, 
the viral genome can be partially digested with EcoRI and 9 kb-15 kb DNA fragments isolated and 
employed to prepare the library. The resulting recombinant phage may be screened using a double-lift 

20 screening method employing the viral cDNA probe, followed by further purification, e.g., plaque-purification 
and propagation in large liquid cultures. From the library, the complete sequence of the virus can be 
obtained and detected with the previously described probe. 

HIV-I DNA (either provirus or cDNA) may be cloned in any convenient vector. Constructs can be 
prepared, either circular or linear, where the HIV-I DNA, either the entire hTLR or fragments thereof, may be 

25 ligated to a replication system functional in a microorganism host, either prokaryotic or eukaryotic cells 
(mammalian, yeast, arthropod, plant). Micro-organism hosts include E. coli, B. subtilis, P. aerugenosa, S. 
cerevisiae , N. crassa , etc. Replication systems may be derived from ColE1 , 2 mu plasm id, X, SV40, bovine 
papilloma virus, or the like, that is, both plasmids and viruses. Besides the replication system and the HIV-I 
DNA, the construct will usually also include one or more markers, which allow for selection of transformed 

30 or transfected hosts. Markers may include biocide resistance, e.g., resistance to antibiotics, heavy metals, 
etc., complementation in an auxotrophic host to provide prototrophy, and the like. 

For expression, expression vectors will be employed. For expression in microorganisms, the expression 
vector may differ from the cloning vector in having transcriptional and translational initiation and termination 
regulatory signal sequences and may or may not include a replication system which is functional in the. 

35 expression host. The coding sequence is inserted between the initiation and termination regulatory signals 
so as to be under their regulatory control. Expression vectors may also include the use of regulatable 
promoters, e.g., temperature-sensitive or inducible by chemicals, or genes which will allow for integration 
and amplification of the vector and HIV-I DNA such as tk, dhfr, metallothionein, or the like. 

The expression vector is introduced into an appropriate host where the regulatory signals are functional 

40 in such host. The expression host is grown in an appropriate nutrient medium, whereby the desired 
polypeptide is produced and isolated from cells or from the medium when the polypeptide is secreted. 

Where a host is employed in which the HIV-I transcriptional and translational regulatory signals are 
functional, then the HIV-I DNA sequence may be manipulated to provide for expression of the desired 
polypeptide in proper juxtaposition to the regulatory signals. 

45 The polypeptide products can be obtained in substantially pure form, particularly free of debris from 
human cells, which debris may include such contaminants as proteins, polysaccharides, lipids, nucleic 
acids, viruses, bacteria, fungi, etc., and combinations thereof. Generally, the polypeptide products will have 
less than about 0.1, usually less than about 0.01 weight percent, of contaminating materials from the 
expression host. Depending upon whether the desired polypeptide is produced in the cytoplasm or 

50 secreted, the manner of isolation will vary. Where the product is in the cytoplasm, the cells are harvested, 
lysed, the product extracted and purified, using solvent extraction, chromatography, gel exclusion, elec- 
trophoresis, or the like. Where secreted, the desired product will be extracted from the nutrient medium and 
purified in accordance with the methods described above. 

The expression products of the env, gag, and pol genes and immunogenic fragments thereof having 

55 immunogenic sites may be used for screening antisera from patients 1 blood to determine whether 
antibodies are present which bind to HIV-I antigens. One or more of the recombinant antigens are used in 
the serological assay. Preferred modes of the assay employ a combination of gag, env, and pol antigens. A 
combination of p25, P31 and env recombinant antigens is particularly preferred. A wide variety of 
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immunoassay techniques can be employed, involving labeled or unlabeled antigens or immobilized 
antigens. The label may be fluorescers, radionuclides, enzymes, chemiluminescers, magnetic particles, 
enzyme substrates, cofactors or inhibitors, ligands, or the like. 

A particularly convenient technique is to bind the antigen to a support such as the surface of an assay 
5 tube or well of an assay plate or a strip of material, such as nitrocellulose or nylon, that binds proteins and 
contact the sample with the immobilized antigen. After washing the support to remove non-specifically 
bound antisera, labeled antibodies to human Ig are added. The support is then washed again to remove 
unbound labeled anti-human Ig. The presence of bound analyte is then determined through detection of the 
label. 

70 ELISA and "dot-blot" assays are particularly useful for screening blood or serum samples for anti-HIV-l 
antibodies. The ELISA assay uses microtiter trays having wells that have been coated with the antigenic 
HIV-I polypeptides(s). The wells are also typically post-coated with a non-antigenic protein to avoid non- 
specific binding of antibodies in the sample to the well surface. The sample is deposited in the wells and 
incubated therein for a suitable period under conditions favorable to antigen-antibody binding. 

75 Anti-HIV-l antibodies present in the sample will bind to the antigen(s) on the well wall. The sample is 
then removed and the wells are washed to remove any residual, unbound sample. A reagent containing 
enzyme labeled antibodies to human immunoglobulin is then deposited in the wells and incubated therein to 
permit binding between the labeled anti-human Ig antibodies and HIV-I antigen-human antibody complexes 
bound to the well wall. Upon completion of the incubation, the reagent is removed and the wells washed to 

20 remove unbound labeled reagent. A substyate reagent is then added to the wells and incubated therein. 
Enzymatic activity on the substrate is determined visually or spectrophotometrically and is an indication of 
the presence and amount of anti-HIV-l antibody-containing immune complex bound to the well surface. 

The w dot-blot" procedure involves using hTLR antigen(s) immobilized on a piece or strip of bibulous 
support material, such as nitrocellulose filter paper or nylon membrane, rather than antigen-coated 

25 microtiter trays. The support will also be treated subsequently with a non-antigenic protein to eliminate non- 
specific binding of antibody to the support. The antigen-carrying support is dipped into the sample and 
allowed to incubate therein. Again, any anti-HIV-l antibodies in the sample will bind to the antigen(s) 
immobilized on the support. After a suitable incubation period the support is withdrawn from the sample and 
dipped repeatedly in wash buffer to remove any unbound sample from the paper. The support is then 

30 dipped into the enzyme-labeled antibody to human Ig reagent for a suitable incubation period. Following 
treatment with the labeled reagent the support is dipped in wash buffer, followed by incubation in the 
substrate solution. Enzymatic activity, indicating the presence of anti-HIV-l antibody-containing complexes 
on the support, causes color changes on the support which may be detected optically. 

Either of these techniques may be modified to employ labels other than enzymes. The reading or 

35 detection phases will be altered accordingly. 

The antigenic polypeptides of HIV-I may also be used as immunogens by themselves or joined to other 
polypeptides for the production of antisera or monoclonal antibodies which may be used for therapy or 
diagnosis. The immunoglobulins may be from any mammalian source, e.g., rodent, such as rat or mouse, 
primate, such as baboon, monkey or human, or the like. For diagnosis, the antibodies can be used in 

40 conventional ways to detect HIV-I in a clinical sample. 

The HIV-I DNA sequences may also be labeled with isotopic or non-isotopic labels or markers and be 
used as DNA probes to detect the presence of native HIV-I nucleotide sequences in samples suspected of 
containing same. 

The following examples are offered by way of illustration and not by way of limitation. 

45 

1 . AIDS related virus-2 (ARV-2) purification and preparation of viral RNA . 

HUT-78 cells infected with ARV-2 (ATCC Accession No. CRL 8597, deposited on August 7, 1984) were 
obtained from Dr. Jay Levy, University of California, San Francisco. Cultures were grown for two weeks in 

so RPMI medium with 10% fetal calf serum. Cultures were centrifuged at 2 Krpm for 1 hr at 4*C using a SW- 
28 rotor. The pellet, containing the virus, was resuspended in 10 mM Tris-HCI, pH 7.5 on ice. The 
resuspended pellet was treated with 10 ug of DNase (Boehringer-Mannhein) and was layered onto a linear 
sucrose gradient (15-50% in 10 mM Tris-HCI, pH 7.5, 1 mM EDTA. 20 mM NaCI). The gradient was spun at 
34 Krpm for 4 hr at 4*C, in SW-41 rotor. Five 2.5 ml fractions were collected and an aliquot of each was 

55 electrophoresed in a 1% agarose, 5 mM methyl mercury hydroxide gel (Bailey and Davidson, Anal Biochem 
(1976) 70:75-85) to determine which contained the 9 kb viral RNA. The fraction containing the viral RNA 
was diluted to 10 ml in 10 mM Tris-HCI, pH 7.5, 1 mM EDTA and was centrifuged at 34 Krpm for 2 hr at 
4°C. The pellet was resuspended in 20 mM Tris-HCI, pH 7.6, 10 mM EDTA, 0.1% SDS, and 200 ug/ml 
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proteinase K. Incubation was carried out for 15 min at room temperature. The mixture was extracted with 
phenol and the aqueous phase was made 400 mM NaCI and precipitated with ethanol. The pellet was 
resusperided in water and stored at -70 • C. 

To purify the viral RNA from the nucleic acid pellet obtained as described above, a sample was 

5 electrophoresed in a low-melting 1% agarose gel containing 5 mM Methyl mercury hydroxide. After 
electrophoresis, the gel was stained with 0.1% ethidium bromide and nucleic acid bands were visualized 
under UV light. The region corresponding to 9 kb was cut from the gel and the agarose was melted at 70 • C 
for 2 to 3 min in three volumes of 0.3 M NaCI, 10 mM Tris, pH 7.5, 1 mM EDTA. The mixture was extracted 
with an equal volume of phenol. The aqueous phase was reextracted with phenol and was precipitated with 

70 ethanol. The pellet was washed with cold 95% ethanol, air dried, resuspended in water and stored at -70 °C 
until use. One hundred ml of culture medium yielded 0.5 to 1 ug of purified RNA. 

2. Synthesis of labeled homologous viral probe . 

75 A 32 P-labeled cDNA was made to the gel purified viral RNA using random primers (calf thymus primers) 
prepared as described in Maniatis, et al, A Laboratory Manual , Cold spring Harbor, NY, 1982. The reaction 
mixture contained 2 ul of 0.5 M MgCfe; 5 til of 0.1 M dithiothreitol; 2.5 ul each of 10 mM dATP, 10 mM 
dGTP and 10 mM dTTP; 2.5 ul calf thymus primer (100A2 G o/ml); 0.5 ug viral RNA; 5 ul of actinomycin D 
(200 ug/ml); 10 ul of 32 P-dCTP (> 3000 Ci/mmole, 1 mCi/ml) and 1 ul of AMV reverse transcriptase (17 

20 units/ul) in a 50 ul reaction volume. The reaction was incubated for 1 hr at 37 • C. The probe was purified 
away from free nucleotides by gel filtration using a Sephadex® G50 column. The void volume was pooled, 
NaCI was added to a final concentration of 400 mM and carrier single-stranded DNA to 100 ug/ml, and the 
cDNA was precipitated with ethanol. The pellet was resuspended in water and incorporated 32 P counts were 
determined. 

25 

3. Detection of ARV sequences in polyA+ RNA prepared from infected HUT-78 cells . 

PolyA+ RNA was prepared from HUT-78 cells infected with ARV-2, ARV-3 or ARV-4 (three different 
isolates from three different AIDS patients) and from uninfected HUT-78 cells. The polyA+ RNA was 
electrophoresed on 1% agarose gels containing 5 mM methyl mercury hydroxide (Bailey and Davidson, 
supra), was transferred to nitrocellulose filters, and hybridized with the homologous probe prepared as 
described in Section 2. Hybridizations were carried out in 50% formamide, 3 x SSC at 42 • C. Washes were 
at 50 °C in 0.2 x SSC. A 9 kbp band was present in all three samples of infected HUT-78 cells. This band 
was absent in polyA+ from uninfected cells. 

4. Detection of ARV sequences in infected and non-infected HUT-78 cells. 

High molecular weight DNA (chromosomal) was prepared from cultures of HUT-78 cells infected with 
ARV-2 and from non-infected HUT-78 cells following the procedure of Luciw, et al, Molec and Cell Biol - 
40 (1984) 4:1260-1269. The DNA was digested with restriction enzyme(s), electrophoresed in 1 % agarose gels 
and blotted onto nitrocellulose following the procedure described by Southern, (1975), supra. Blots were 
hybridized with the 32 P-labeled probe (10 6 cpm/blot) in a mixture containing 50% formamide, 3 x SSC, 10 
mM Hepes, pH 7.0, 100 ug/ml denatured carrier DNA, 100 ug/ml yeast RNA and 1 x Denhardt's for 36 hr at 
42 *C. Filters were washed once at room temperature in 2 x SSC and twice at 42 ° C in 0.2 x SSC, 0.1% 
45 SDS. Filters were air dried and exposed to X-Omat film using an intensifying screen. 

The homologous ^P-probe to ARV-2 hybridized specifically to two bands in the DNA from infected 
cells restricted with Sacl. These bands were absent when DNA of non-infected cells was used, indicating 
that the probe is hybridizing specifically to infected cells presumably to the provirus integrated in the 
chromosomal DNA. The molecular weight of the bands is approximately 5 kb and 3 kb. 
so In order to determine if different enzymes would cut the proviral sequence, several other restriction 
digestions of the cell DNA were carried out using EcoRI, Sphl or Kpnl or double digestions using two of 
them. Southern results show specific bands hybridizing when DNA of infected cells is used. Figure 1 shows 
a schematic map of the positions of restriction enzyme sites in the proviral sequence, and indicates 
fragment sites. 

55 



30 
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5. Cloning of proviral ARV-2 DNA . 

High molecular weight cell DNA from infected HUT-78 cells was prepared following the procedure of 
Luciw, et al, supra. The DNA was digested with EcoRI, which cuts once in the provirus, centrifuged in a 

s sucrose gradient and fractions corresponding to 8-TsTkb were pooled, dialyzed and concentrated by ethanol 
precipitation. The bacteriophage X derivative cloning vector, EMBL-4 (Karn, et al. Methods Enzymol (1983) 
101:3-19) was digested to completion with a mixture of EcoRI, BamHI and Sail restriction enzymes and the 
DNA then deproteinized by phenol-chloroform extractionTprecipitated with cold ethanol and resuspended in 
ligation buffer. The EMBL-4 phage "DNA and EcoRI digest of cellular DNA were mixed and ligated and the 

w resultant recombinant phage genomes packaged in vitro. After phage infection of X-sensitive E. coli - 
(DPSOsupF), about 500,000 phage plaques were transferred onto nitrocellulose filters, DNA was fixedTnd 
the filters were screened with a homologous ^P-probe prepared as described in Section 2. Eleven 
recombinant phage out of 500,000 phage annealed in the initial double-lift screening method (Maniatis, et al, 
Molecular Cloning, A Laboratory Manual , NY, 1982) to viral cDNA probe, and these were further plaque- 

75 purified and propagated in large liquid cultures for preparation of recombinant DNA. Plaque-purified phage 
containing ARV DNA were propagated in liquid culture in E. coli DP50supF: phage particles were harvested 
and banded in CsCI gradients and recombinant phage DNA was prepared by phenol extraction followed by 
ethanol precipitation (Maniatis, et al, supra). One uq of purified phage DNA was digested with restriction 
enzymes, electrophoresed on 1% agarose gels, and visualized with ethidium bromide under ultraviolet light. 

20 The DNA from these gels was transferred to nitrocellulose and annealed with viral cDNA probe. 

One of the 11 phage, designated X ARV-2(9B), was deposited at the ATCC on 25 January 1985 and 
given Accession No. 40158. X ARV-2(9B) contained an insertion of full-length proviral DNA along with 
flanking cell sequences. Digestion of X ARV-2(9B) DNA with Sacl yielded viral DNA fragments of 3.8 kb and 
5.7 kb. EcoRI digestion of X ARV-2(9B) produced virus containing DNA species at 6.4 kb and 8.0 kb; a 

25 double digest of Sacl and EcoRI gave viral DNA fragments at 3.8 kb and 5.4 kb. This pattern is consistent 
with that of a provirus linked to cell DNA. 

In addition to X ARV-2(9B), phage was obtained that (1) possessed the left half of the viral genome from 
the EcoRI site in viral DNA extending into flanking cell DNA (XARV-2(8A)) and (2) phage that had the right 
half of the viral genome (XARV-2(7D)) from the EcoRI site in viral DNA extending into flanking cell DNA. 

30 Bacteriophages XARV-2(7D) (right) and XARV-2(8A) (left) were deposited at the ATCC on October 26. 1984 
and given Accession Nos. 40143 and 40144, respectively. 

6. Polymorphism . 

35 To measure the relatedness of independent ARV isolates, restriction enzyme digests of DNA from HUT- 
78 cells infected with ARV-3 and ARV-4 were analyzed with the probe made from cloned ARV-2 DNA. The 
Sac l digest of ARV-3 DNA was similar to that of ARV-2 whereas the Hindlll digests displayed different 
patterns. The Sacl digest and the Pstl digest of ARV-4 DNA differed from the corresponding digests of 
ARV-2 DNA. The intensity of the annealing signals obtained with ARV-3 and ARV-4 samples was much 

40 lower (about 10-fold less) than that for ARV-2 DNA probably as a result of the fact that fewer cells were 
infected in the ARV-3 and ARV-4 cultures. The viral-specific DNA fragments produced by Sacl treatment of 
ARV-3 and ARV-4 DNA totaled 9.0-9.5 kbp, a value similar to that of ARV-2 and in consonance with the 
RNA genome sizes. 

45 7. Sequencing of proviral DNA . 

Fragments or subfragments of ARV-2 DNA from X phage 9B were prepared and cloned into M13 
according to conventional procedures (Maniatis, et al, supra). Sequencing was performed according to 
Sanger, et al, Proc Natl Acad Sci USA (1977) 74:5463, using the universal M13 primer or chemically 
so synthesized primers complementary to ARV-2 sequence. The sequence is shown in Figure 2. 

8. Amino acid sequence analysis of p25 and p16 gag coded proteins . 

ARV-2 was prepared and purified as described in Section 1 . The viral proteins were electrophoresed on 
55 an acrylamide gel, and the band corresponding to a 24,000 daiton or 16,000 dalton protein was excised 
from the gel and used for sequencing. Micro-sequence analysis was performed using Applied Biosystems 
model 470A protein sequencer similar to that described by Hewick, et al, J Biol Chem (1981) 256:7990- 
7997. Phenylthiohydantoin amino acids were identified by HPLC using a Beckman ultrasphere OD"S~column 
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and a trifluoroacetic acid-acetonitrile buffer system as reported by Hawke, et al, Anal Biochem (1982) 
120:302-311. Table 1 shows the first 20 amino acids from the amino terminus determined for p25-gag 
protein and Table 2 shows the first 30 amino acids for p16-gag protein. 

5 TABLE 1 



Amino-terminal sequence of p25-gag 


Position 


Amino acid 


1 


Pro 


2 


He 


3 


Val 


4 


Gin 


5 


Asn 


6 


Leu 


7 


Gin 


8 


. Gly 


9 


Gin 


10 


Met 


11 


Val 


12 


(His) 


13 


Gin 


14 


Ala 


15 


lie 


16 


(Ser) 


17 


Pro 


18 


(Arg, Lys) 


19 


Thr 


20 


(Leu) 
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TABLE 2 



Amino-terminal sequence of p16-gag 


rusiuon 


Ammo acia 


1 


(Met) 


2 


Gin 


3 


Arg 


4 


Gly 


5 


Asn 


6 


Phe 


7 


Arg 


8 


Asn 


9 


Gin 


10 


Arg 


11 


Lys. 


12 


Thr 


13 


Val 


14 


Lys 


15 


— (Cys) 


16 


Phe 


17 


Asn 


18 


™(Cys) 


19 


Gly 


20 


Lys 


21 


Glu 


22 


Gly 


23 


(His) 


24 


lie 


25 


Ala 


26 


(Lys) 


27 


Asn 


28 


(Gly) 


29 


(Arg) 


30 


(Ala, Leu) 



The. amino acid sequence of Table 1 is predicted from the ARV-2 DNA sequence of Figure 2. 
Therefore, these results confirm that the indicated gag open reading frame is in fact being translated and 
40 identifies the N-termini of p25 and p16. 

9. Expression of p25 gag protein of ARV-2 in bacteria . 

A. Host-vector system 

45 ~~ — — 

The p25 gag protein is synthesized by E. coli strain D1210 transformed with plasmid pGAG25-10. 

Plasmid pGAG25-10 is a pBR322 derivative which contains the sequence coding for p25 gag under 
transcriptional control of a hybrid tac promoter (De Boer et al, PNAS (1983), 80:21-25) derived from 
sequences of the trp and the lac UV5 promoters. Expression of p25 gag is" induced in bacterial 
50 transformants with isopropylthiogalactoside (IPTG). 

E. coli D1210, a lac-repressor overproducing strain, carries the lacl q and lacY + alleles on the 
chromosome but otherwise is identical to E. coli HB101 (F~ lacl + , lacO + , lacZ + , lacY", gal", pro", leu", thi", 
end", hsm", hsr", recA", rpsL") from which it was derived. 

55 B. Construction of pGAG25-10 . 

Plasmid pGAG25-10 was constructed by cloning a 699 bp DNA fragment coding for p25 gag into 
plasmid ptacS, according to the scheme shown in Figure 3. The vector ptac5 is a pBR322 derivative which 
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contains the tac promoter. Shine Delgarno sequences, and a polylinker as a substitution of the original 

pBR322 sequences comprised between the EcoRI and Pvull restriction sites. 

The 699 bp DNA fragment codes for the complete p25 gag protein (amino acid residues 139 to 369 as 

numbered in Figure 2), the only difference being that a methionine was added as the first amino acid in 
5 pGAG25-10 to allow for translational initiation. This change, as well as other changes in nucleotide 

sequence as indicated below, was achieved by using chemical synthesis of party of the DNA fragment. The 

DNA fragment also includes two stop codons at the 3* end of the sequence. 

Figure 4 shows the nucleotide sequence cloned in pGAG25-10 and the amino acid sequence derived 

from it. DNA sequences that are not underlined in the figure were derived directly from the ARV-2(9B) 
10 cDNA. All other sequences were chemically synthesized or derived from vector ptac5. Changes were 

introduced in this DNA sequence, with respect to the original cDNA, to create or delete restriction sites, to 

add a methionine prior to the proline (first residue of p25) or to include stop codons after the last codon of 

p25 gag. However, as previously indicated, all changes in the DNA sequence, except those in the first 

codon, do not alter the amino acid sequence of p25 gag. 

75 

C. Preparation of D1210 (pGAG25-10) strain and characterization of p25 gag protein expressed by 
transformants . 

E. coli D1210 cells are made competent for transformation following a standard protocol (Cohen et al, 
20 PNAS (1972) 69:2110). Transformation is performed as indicated in the protocol with 25-50 ng of pGAG25- 
10. The transformation mix is plated on agar plates made in L-broth containing 100 ug/ml ampicillin. Plates 
are incubated for 12 hr at 37° C. 

Single ampicillin resistant colonies are transferred into 1 ml L-broth containing 100 ug/ml ampicillin and 
grown at 37 "C. Expression of p25 gag protein is induced by adding 10 ul of 100 mM IPTG (Sigma) to a 
25 final concentration of 1 mM followed by incubation at 37 °C for 2 hr. 

Cells from 1 ml of induced cultures are pelleted and resuspended in 100 ul Laemmli sample buffer. 
After 3 cycles of boiling and freezing, portions of resultant lysates are analyzed on standard denaturing 
acrylamide gels. Proteins are visualized by staining with Coomassie blue. 

The extent of expression is initially determined by appearance of new protein bands for induced 
30 candidate samples compared with control. Proteins of molecular weights expected for the genes expressed 
comprised 2%-5% of total cell protein in the highest expressing recombinants as determined by visual 
inspection with reference to a standard protein of known amount. 

Authenticity of the expressed proteins is determined by standard Western transfer of proteins to 
nitrocellulose and analysis with appropriate human or rabbit immune sera or mouse monoclonal antibodies 
35 (see E.4.a. below) or by ELISA assays of soluble E. coli proteins using human immune sera from AIDS 
patients (see E.4.b. below). — 

D. Fermentation process. 

40 D.1 . Preparation of transformant master seed stock . 

Transformant cells from a culture expressing high levels (3%) of p25 gag are streaked onto an L-broth 
plate containing 100 ug/ml ampicillin and the plate is incubated overnight at 37 *C. A single colony is 
inoculated into 10 ml of L-broth, 100 ug/ml ampicillin and grown overnight at 37 *C. An aliquot is used to 
45 verify plasmid structure by restriction mapping with Sail and Pstl. A second aliquot is used to induce 
expression of p25 gag and the rest of the culture is niade 1 5%~~glycerol by adding 1/4 volume of 75% 
sterile glycerol. Glycerol cell stocks are aliquoted in 1 ml and quickly frozen in liquid nitrogen or dry-ice 
ethanol bath. These master seed stocks are stored at -70 • C. 

so D.2. Master plate/single colonies and overnight cultures. 

The master seed stock is scraped with a sterile applicator which is used to streak an L-broth plate 
containing 100 ug/ml ampicillin. Single colonies from this plate are used to inoculate 20-50 ml of L- 
broth/amp, which is incubated at 37 ° C overnight. 

55 
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D.3. Fermentor inoculum. 

An aliquot of the overnight culture is used to inoculate larger volumes (1-6 liters) of L-broth/amp. Cells 
are incubated at 37 # C overnight and reach an O.D. 6 so of approximately 5 prior to use as inoculum for the 
5 fermenter run. 

D. 4. Fermentation and harvest . 

Fermenters (capacity: 16 liters) containing 10 I of L-broth and 1 ml of antifoam are inoculated with 100- 
70 500 ml from the inoculum culture. Cells are grown at 37* C to an O.D. of about 1. Expression of p25 gag is 
induced by addition of 100 ml of an IPTG solution (100 mM) to yield a 1 mM final concentration in. the 
fermenter. Cells are grown for 3 additional hours and subsequently harvested using continuous flow 
centrifugation. At this step cells may be frozen and kept at -20 'C until purification of p25 gag proceeds. 
Alternatively, 250 I fermenters are inoculated with 1-5 I- from the inoculum culture. Growth, induction, and 
75 harvest are as indicated before. 

E. Purification and characterization of p25 gag . 
E.1. Cell breakage . 

20 

Frozen E. coli cells are thawed and suspended in 2.5 volumes of lysis buffer (0.1 M sodium phosphate 
(NaPi). pH 7.5. 1 mM EDTA. 0.1 M NaCI). Cells are broken in a non-continuous system using a 300 ml 
glass unit of a Dyno Mill at 3000 rpm and 140 ml of acid-washed glass beads for 15 min. The jacketed 
chamber is kept cool by a -20 * C ethylene glycol solution. Broken cells are centrif uged at 27.000 x g for 25 
25 minutes to remove debris and glass beads. The supernatant is recovered and kept at 4°C. 

E.2. Selective protein precipitation . 

The cell extract is made 30% (NH^SO* by slowly adding the ammonium sulfate at 4'C. The extract is 
30 stirred for 10. min after the final concentration is achieved, followed by contrifugation at 27.000 x g for 20 
min. The pellet is resuspended in 1 M NaCI, 1 mM EDTA. 1% Triton® X-100, and 5% SDS, and then boiled 
for 5 min. 

E.3. Gel filtration . 

35 

The fraction obtained by selective precipitation is submitted to gel filtration using a G50 Sephadex 
column equilibrated in 0.03 M NaPi, pH 6.8. Chromatography is developed in the same solution. Fractions 
are collected and absorbance at 280 nm is determined. Protein-containing fractions are pooled and 
characterized by protein gel electrophoresis, Western analysis, and ELISA. 

40 

E.4. Characterization of recombinant p25 gag . 

- a. Protein gel electrophoresis. SDS-polyacrylamide gel analysis (10%-20% gradient gels) of proteins 
from pGAG25-containing cells and control cells indicated that varying levels of a protein of a molecular 

45 weight of about 25,000 were specifically induced in cells containing p25 gag expression plasmids after 
derepression of the tacl promoter with IPTG. Identity of the p25 gag gene product was confirmed by 
both an enzyme-linked immunosorbent assay (ELISA, see E.4.c.) and Western immunoblot analysis (see 
E.4.b.) using both AIDS patient serum and a monoclonal antibody to viral p25 gag. 
b. Western analysis . Samples were electrophoresed under denaturing conditions on a 10%-20% 

so polyacrylamide gradient gel. Samples were electroblotted onto nitrocellulose. The nitrocellulose paper 
was washed with a 1:250 dilution of AIDS patient reference serum (EW5111, obtained from P. Feorino, 
Centers for Disease control, Atlanta, Georgia) and then with a 1:500 dilution of HRP-conjugated goat 
antiserum to human immunoglobulin (Cappel, No. 3201-0081). Alternatively, the nitrocellulose was 
washed with undiluted culture supernatant from 76C, a murine monoclonal antibody to ARV-2 p25 gag, 

55 and then with a 1 :500 dilution of HRP-conjugated goat antiserum to mouse immunoglobulin (TAGO, No. 
6450). The substrate for immunoblots was HRP color development reagent containing 4-chloro-1- 
naphthol. 
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The p25 gag protein reacted with both AIDS patient reference serum and with the monoclonal 
antibody, while It shows no reactivity with the non-immune serum. 

c - ELISA - P 25 9 a 9 was purified from bacterial extracts as previously described. The reactivity of sera 
with the purified protein was assayed by coating wells of microtiter plates with 0.25 ug/ml, adding 
dilutions of test sera (positive reference serum EW5111 of human negative serum), followed by a 1:1000 
dilution of HRP-conjugated goat antiserum to human immunoglobulin. p25 gag protein reacted with the 
positive serum with a midpoint of titration curve of approximately 1:800. There was no reactivity with 
serum from a normal individual. 

10. Comparison of recombinant p25 gag protein and natural p25 gag protein in ELISA . 

The reactivity of purified recombinant p25 gag to various sera was compared to that of natural p25 gag 
protein purified by preparative polyacrylamide gel electrophoresis in an ELISA assay. For control, assays 
were also made using disrupted gradient purified virus (5 ug/ml). 

PVC microtiter plates were incubated for 2 hr at 37 9 C with 10 ug/ml (50 ul/well in 0.1 M sodium borate, 
pH 9.0) of the Ig fraction of ascites from murine anti-p25 gag monoclonal antibody 76C. The plates were 
washed with PBS and the wells were filled with 10% normal goat serum in PBS. Following a 30 min 
incubation at room temperature, the plates were washed with normal saline containing 0.05% Triton® X-.100 
(ST) and dilutions of the test ARV protein (50 ul/well in ST with 10% goat serum [STGS]) were added to the 
wells. The plates were incubated for 2 hr at 37 *C, washed with ST, and then incubated for 1 hr at 37 *C 
with 50 ul/well of rabbit antiserum raised against disrupted ARV (1:1000 dilution in STGS). The wells were 
washed, incubated for 1 hr with 50 ul of a 1:1500 dilution in STGS of HRP-conjugated goat antiserum to 
rabbit immunoglobulin, washed, and then the wells received 50 ul/well of substrate solution (150 ug/ml 2,2'- 
azino-di-[3-ethyibenzthiazolene sulfonic acid], 0.001% H2O2, 0.1 M citrate pH 4). The reaction was stopped 
after incubation for 30 min at 37 °C by the addition of 50 ul/well of 10% SDS. The absorbance was read on 
a Flow Titertech ELISA reader at 414 nm. Samples were assayed in duplicate beginning at a dilution of 1:10 
and by serial 2-fold dilutions thereafter. 

The table below summarizes the results of assays on 8 AIDS sera that scored positive in the assay with 
disrupted virus and 6 normal sera that were negative in the disrupted virus assay. 
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SERUM ELISA ASSAY TITER * 
NUMBER 

Disrupted Virus Recomb. p25 gag Viral d25 gag 
Group I: Sera Scoring As Positive in Virus ELISA b 



1 


51,200 


3.125 


3.125 


5 


12,800 


25 


25 


6 


12,800 


625 


625 


7 


12,800 


3.125 


3.125 


8 


25. 6\)0 


15. 625 


15.625 


9 


12,800 


625 


625 


13 


800 


125 


125 


18 


3,200 


625 


625 



GroupII: Sera Scoring Negative in Virus ELISA b 

15 -c 
16 

20 19 
21 
26 
33 



Reported as the reciprocal of the serum dilution that 
gave a signal eguivalent to 50% of the maximum. 
Results were confirmed by immunofluorescence and im- 
munoblotting as described previously. 
No detectable signal at a 1:25 serum dilution. 

These results show that p25 gag purified from bacteria behaves identically to similarly purified p25 gag 
from AIDS virus in an ELISA of the eight AIDS patient sera. The results of the ELISA show that there is a 
35 wide variation in the levels of anti-p25 gag antibodies and suggests that antibodies to some virus-encoded 
proteins may not be detected using conventional virus-based assay systems. 

1 1 . Expression of p41 gag protein of ARV-2 in bacteria . 

40 A fusion protein of the p25 gag and p16 gag proteins of ARV-2, designated p41 gag, was synthesized 
in E. coli . strain D1210 transformed with plasmid pGAG41-10. pGAG41-10 was constructed from plasmid 
pGAG25-10 as shown in Figure 3 by inserting an Sphl-Hpal fragment from the ARV-2 genome containing 
the sequences from the C-terminal p16 gag portiorTof the p53 gag precursor polyprotein and part of the 
p25 gag protein between the Sphl and BamHI sites of pGAG25-10. The coding strand of the DNA sequence 

45 cloned in pGAG41-10 is shown in Figure 5. Transformation and induction of expression were effected by 
the procedures described above. The cells were treated and the p41 gag protein was visualized on 
Coomassie-stained gel as described above. The approximate molecular weight of the observed protein was 
41,000 daltons. The protein reacted with AIDS sera and monoclonal antibody to p25 gag in Western and 
ELISA analyses carried out as above. 

50 

12. Expression of p16 gag protein of ARV-2 in bacteria . 

The sequence shown in Figure 6 and coding for the p16 gag protein was chemically synthesized using 
yeast-preferred codons. The blunt-end Sail fragment (381 bp) was cloned into Pvull-Sall digested and gel- 
55 isolated ptac5 (see 9 and 11 above). The resulting plasmid was used to transform"D1210 cells, as in 9 
above. Expression was induced with IPTG, and proteins were analyzed by polyacrylamide gel elec- 
trophoresis and Western analysis. A band of about 16,000 daltons was induced by IPTG in the transformed 
cells. This protein showed reactivity in Western blots with immune sera from AIDS patients. No reactivity 



25 

a . 
b. 

30 C. 
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was observed with sera from normal individuals. 

A recombinant gag protein was also expressed in Cos (mammalian) cells. 

13. Production of ARV-2 env protein by yeast . 

5 

A. Host-vector system . 

A partial env protein is synthesized by S. cerevisiae 2150-2-3 transformed with plasmid pDPC303. 
Plasmid pDPC303 is a yeast expression vector which contains the sequence coding for 2/3 of the env 
10 protein as well as pBR322 sequences including the amp R gene and 2-micron sequences including the yeast 
leu 2-04 gene. Expression of env is under regulation of the yeast pyruvate kinase promoter and terminator 
sequences. Yeast strain S. cerevisiae 2150-2-3 has the following genotype: Mat a, ade 1, leu 2-112, cir*. 
This strain was obtained from Dr. Leland Hartwell, University of Washington. 

is B. Construction of pDPC303, a yeast expression vector for env protein . 

Plasmid pDPC303 contains an "expression cassette" (described below) for env cloned into the BamHI 
site of vector pCI/l. Vector pCI/l contains pBR322 and 2 micron sequences including the amp R and yeast leu 
2-04 markers. It was derived from pJDB219d (Beggs, Nature (1978), 275: 104) by replacing the pMB9 
20 region with pBR322 sequences. 

The "expression cassette" for env consists of the following sequences fused together in this order (5* to 
3'): yeast pyruvate kinase (PYK) promoter, env cDNA, and PYK terminator. The PYK promoter and 
terminator regions were derived from PYK cDNA isolated as described in Burke, et al, J Biol Chem (1983) 
258:2193-2201. 

25 The env fragment cloned into the expression cassette was derived from ARV-2 cDNA and comprises a 
1395 bp cDNA fragment which codes for env amino acid residues coded by nt 5857 to nt 7251 (Figure 2). 
In addition, there are 5 extra codons fused in reading frame in the 5' end, the first codon corresponding to a 
methionine, and 4 extra codons fused in reading frame at the 3' end followed by a stop codon. The extra 
codons were incorporated to facilitate cloning procedures exclusively. 

30 Figure 7 shows the coding strand of the nucleotide sequence cloned in pDPC303 and the amino acid 
sequence derived from it. DNA sequences that are not underlined in the figure were derived directly from 
the ARV-2 (9B) cDNA described above. All other sequences were either chemically synthesized or derived 
from the PYK vector. 

35 C. Preparation of 2150 (pDPC303) strain 

Yeast cells S. cerevisiae 2150-2-3 (Mat a , ade 1 , leu 2-04 , cir) were transformed as described by 
Hinnen et al (PNAS (1978) 75:1929-1933) and plated onto leu-~~selective plates. Single colonies were 
inoculated into leu- selective media and grown to saturation. Cells were harvested and the env protein was 
40 purified and characterized as described below. 

D. Purification and characterization of env protein. 

D.1. Cell breakage . 

45 

Frozen S. cerevisiae 2150-2-3 (pDPC303) are thawed and suspended in 1 volume of lysis buffer (1 
ug/ml pepstatin, 0.001 M PMSF, 0.001 M EDTA, 0.15 M NaCI, 0.05 M Tris-HCI pH 8.0), and 1 volume of 
acid-washed glass beads are added. Cells are broken in a non-continuous system using a 300 ml glass unit 
of Dyno Mill at 3000 rpm for 10 min. The jacket is kept cool by a -20 'C ethylene glycol solution. Glass 
so beads are decanted by letting the mixture set for 3 minutes on ice. The cell extract is recovered and 
centrifuged at 18,000 rpm (39,200 x g) for 35 min. The supernatant is discarded and the precipitate (pellet 
1) is further treated as indicated below. 

D.2. SDS extraction of insoluble material. 

55 ^ ~ ~~ 

Pellet 1 is resuspended in 4 volumes of Tris-HCI buffer (0.01 M Tris-HCI, pH 8.0, 0.01 M NaCI, 0.001 M 
PMSF, 1 ug/ml pepstatin, 0.001 M EDTA, 0.1% SDS) and extracted for 2 hr at 4°C with agitation. The 
solution is centrifuged at 6,300 x g for 15 min. The insoluble fraction (pellet 2) is resuspended in 4 volumes 
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(360 ml) of PBS (per liter: 0.2 g KCI, 0.2 g KH2PO4, 8.0 g Nad, 2.9 g Na2HP04.12H 2 0), 0.1% SDS, 0.001 
M EDTA, 0.001 M PMSF, 1 ug/ml pepstatin, and centrifuged at 6,300 x g for 15 min. The pellet (pellet 3) is 
suspended in 4 volumes of PBS, 0.2% SDS, 0.001 M EDTA, 0.001 M PMSF, 1 ug/ml pepstatin and is 
extracted for 12 hr at 4 # C with agitation on a tube rocker. The solution is centrifuged at 6,300 x g for 15 
5 min. The soluble fraction is recovered for further purification as indicated below. (The pellet can be 
reextracted by resuspending it in 4 volumes of 2.3% SDS, 5% 0-mercaptoethanol, and boiling for 5 min. 
After boiling, the solution is centrifuged at 6,300 x g for 15 min. The soluble fraction is recovered for further 
purification.) 

10 D.3. Selective precipitation and gel filtration . 

The soluble fraction is concentrated by precipitation with 30% ammonium sulfate at 4°C. The pellet 
(pellet 4) is resuspended in 2.3% SDS, 5% £-mercaptoethanol, and chromatographed on an ACA 34 (LKB 
Products) gel filtration column. The column is equilibrated with PBS, 0.1% SDS, at room temperature. 
;s Chromatography is developed in the same solution with a flow rate of 0.3 ml/min. Five ml fractions are 
collected, pooled and characterized by protein gel electrophoresis, Western analysis, and ELISA. If needed, 
pooled fractions are concentrated by vacuum dialysis on Spectrapor #2 (MW cutoff 12-14K). 

D.4. Characterization of recombinant env . 

20 

SDS polyacrylamide gel analysis (12% acrylamide gels) showed that a new 55,000 dalton protein was 
being synthesized in yeast cells transformed with the env-containing vector. The 55,000 dalton protein is 
absent from cells transformed with control plasmid (vector without env insert). The identity of env was 
confirmed by both ELISA (see 9.E.4.C) and Western analysis using AIDS patient serum. In both assays the 
25 55,000 dalton protein showed immunoreactivity. No reactivity was obtained with serum from a normal 
individual. 

Recombinant env was also expressed in mammalian (Cos) cells. 
14. Expression of p31 pol protein of ARV-2 in bacteria . 

30 

A. Host vector system 

The C-terminal region of the polymerase gene (p31 pol) is synthesized by E. coli strain D1210 
transformed with plasmid pTP31 .2. Plasmid pTP31 .2 is a pBR322 derivation which contains the sequence 
35 coding for p31 under transcriptional control of the hybrid tac promoter (described in 9.A). Expression of p31 
is induced in bacterial transformants by IPTG. 

B. Construction of pTP31 .2 . 

40 B.1 . Construction of M13 template 01100484 . 

A 5.2 kb DNA fragment was isolated from a Kpnl digest of ARV-2 (9b) containing the 3' end of the pol 
gene, orf-1, env and the 5* end of orf-2, that had been run on a 1% low melting point agarose (Sea-Pack) 
gel and extracted with phenol at 65 *C, precipitated with 100% ethanol and resuspended in TE. Eight ul of 

45 this material were further digested with Sstl for 1 hr at 37 °C in a final volume of 10 ul. After heat 
inactivation of the enzyme, 1.25 ul of this digest were ligated to 20 ng of M13mp19 previously cut with Kpnl 
and Sstl, in the presence of ATP and in a final volume of 20 ul. The reaction was allowed to proceed for 2 
hr at room temperature. Five ul of this mixture were used to transform competent E. coli JM101. Clear 
plaques were grown and single-stranded DNA was prepared as described in Messing and Vieira, Gene - 

50 (1982)19:269-276. " 

B.2. In Vitro mutagenesis of 01100484 . 

The DNA sequence in 01100484 was altered by site specific mutagenesis to generate a restriction site 
55 recognized by Ncol (CCATGG). An oligodeoxynucleotide that substitutes the A for a C at position 4299 
(Figure 2) and changes a T for an A at position 4305 (Figure 2) was synthesized using solid phase 
phosphoramidite chemistry. Both of these changes are silent in terms of the amino acid sequence, and the 
second one was introduced to decrease the stability of the heterologous molecules. The oligomer was 
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named ARV-216 and has the sequence: 5'-TTAAAATCACTTGCCATGGCTCTCCAATTACTG and corre- 
sponds to the non-coding strand since the M13 derivative template 01100484 is single-stranded and 
contains the coding strand. The 5' phosphorylated oligomer was annealed to the 01100484 M13 template at 
55 *C in the presence of 5' dephosphorylated M13 sequence-ing primer, 50 mM Tris-HCI pH 8, 20 mM KCI, 

5 7 mM MgCI 2 and 0.1 mM EDTA. The polymerization reaction was done in 100 ul containing 50 ng/ul DNA 
duplex, 150 uM dNTPs, 1 mM ATP, 33 mM Tris-acetate pH 7.8, 66 mM potassium acetate, 10 mM 
magnesium acetate, 5 mM DTT, 12.5 units of T4 polymerase, 100 ug/ml T4 gene 32 protein and 5 units of 
T4 DNA ligase. The reaction was incubate-ed at 30 • C for 30 min and was stopped by the addition of EDTA 
and SDS (10 mM and 0.2% respectively, final concentration). Competent JM101 E. coli cells were 

70 transformed with 1, 2, and 4 ul of a 1.:10 dilution of the polymerization product and plated into YT plates. 
Plaques were lifted by adsorption to nitrocellulose filters and denatured in 0.2 N MaOH, 1.5 M NaCI, 
followed by neutralization in 0.5 M Tris-HCI pH 7.3, 3 M NaCI and equilibrated in 6 x SSC. The filters were 
blotted dry, baked at 80 °C for 2 hr and preannealed at 37 'C in 0.2% SDS, 10 x Denhardfs, 6 x SSC. After 
1 hr, 7.5 million CPM of labeled ARV-216 were added to the filters and incubated for 2 additional hr at 

75 37 *C. The filters were washed in 6 x SSC at 42 °C for 20 min, blot-dried and used to expose film at -70 *C 
for 1 hr using an intensifying screen. Strong hybridizing plaques were grown and single-stranded DNA was 
prepared from them and used as templates for sequencing. Sequencing showed that template 01021785 
contains the Nco l site as well as the second substitution mentioned above. 

A second oligomer was synthesized to insert sites for Sail and EcoRI immediately after the termination 

20 codon of the pol gene (position 5101, Figure 2). This oligomer was called ARV-248 and has the sequence: 
5'-GGTGTTTTACTAAAGAATTCCGTCGACTAATCCTCATCC. Using the template 01020785, site specific 
mutagenesis was carried out as described above except that the filter wash after the hybridization was done 
at 65 • C. As above, 8 strong hybridizing plaques were grown and single-stranded DNA was sequenced. The 
sequence of template 01031985 shows that it contains the restriction sites for Ncol, Sail, and EcoRI as 

25 intended. 

B.3. Isolation of Ncol-EcoRI and Ncol-Sall DNA fragments that contain p31 . 

Replicative form (RF) of the 01031985 template was prepared by growing 6 clear plaques, each in 1.5 
30 ml of 2 x YT at 37 *C for 5 hr. Double-stranded DNA was obtained as described by Maniatis, et al, 
Molecular Cloning, a Laboratory Manual , Cold Spring Harbor, 1982, pooled and resuspended in 100 ul final 
volume. Ten ul of RF were digested with Ncol and EcoRI in a final volume of 20 ul. This fragment was 
used for direct p31 expression in bacteria. An additionaT20 ul of RF were cut with Ncol and Sail in 40 ul. 
This fragment was used for p31 expression in yeast. The samples were run on a~T% low melting point 
35 agarose (Sea-Pack) gel and the DNAs were visualized by fluorescence with ethidium bromide. The 800 bp 
bands were cut and the DNAs were extracted from the gel as mentioned above and resuspended in 10 ul 
of TE. The fragments were called ARV248NR#2 and ARV248NL, respectively. 

B.4. Cloning of p31 into plot7 . 

40 

The vector plot7 (3 ug) (Hallewell, et al, Nucl Acid Res (1985) 13, No. 6, pp. 2017-2034) was cut with 
Nco l and EcoRI in 40 ul final volume and the enzymes were heat-inactivated after 3 hr. Two ul of this 
digest were mixed with 2 ul of ARV248NR#2 and ligated in 20 ul in the presence of ATP and T4 DNA 
ligase at 14 'C overnight, and 10 ul of this mixture were used to transform competent D1210 cells. Colonies 
45 resistant to 2 mM IPTG and 100 ug/ml ampicillin were selected and supercoiled DNA was extracted from 
each of them. The DNAs were then restricted with Ncol and EcoRI and analyzed by agarose gel 
electrophoresis. Clones with the appropriate 800 bp Insert were~~se!ected for further use. They are 
designated pRSP248 numbers 3 and 4. 

so B.5. Construction of pTP31 , 

The Nco l site introduced into 01100485 is 52 bp downstream from the putative start of p31. Three 
oligomers were synthesized as above that code for the first 18 amino acids of p31 and generate a cohesive 
Nco l end at the 3 f end of the molecule. The 5' end of the molecule has been extended beyond the initiation 
55 codon to include a ribosome binding site. The oligomers that were synthesized have the sequences: 
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ARV-221.- 

CCCC C C 

5 1 AGGXAACAGAAAAATGATAGATAAGGCACAAGAA 
TTTT T 

5 

ARV-222.- 
5 1 GAACATGAGAAATATCACAGTAATTGGAGAGC 

ARV-223.- 

10 3 1 GGTGTTCTTCTTGTACTCTTTATAGTGTCATTAACCTCTCGGTAC 

One hundred fifty picomoles each of dephosphorylated ARV-211, phosphorylated ARV-222 and ARV 
223 were ligated to 20 ug of pRSP248 previously cut with Ncol, at 14° C for 18 hr in a final volume of 62 

75 ul. After phenol extraction and ethanol precipitation, the DNA was resuspended in 40 ul H 2 0 and incubated 
with 15 units of Klenow fragment in the presence of 0.5 mM dNTPs for 1 hr at room temperature. The 
sample was phenol extracted, ethanol precipitated, resuspended in 40 ul H 2 0, and digested with EcoRI. 
The DNA was then run on a low melting point agarose gel and the fragment of about 820 bp was extracted 
as described above and resuspended in a final volume of 20 ul of H 2 0. After phosphorylating the ends, 5 ul 

20 of the sample were incubated for 18 hr at 14*C with 150 ng of plot7 that had been cut with Pvull and EcoRI 
and its ends dephosphorylated, in the presence of T4 DNA ligase, ATP and in a final volume~of 31 ulTFive 
ul of ligation product were used to transform RRIdeltaM15. Clones resistant to 100 ug/ml of ampicillin were 
selected and supercoiled DNA was extracted from them. The DNAs were digested with Ncol and EcoRI and 
resolved on a 1% agarose gel. Colonies with the appropriate size insert were obtained~and namedpTP31. 

25 The p31 sequence contained in the insert is shown in Figure 8. Underlined sequences were chemically 
synthesized. Others were derived from DNA. 

C. Screening of transformants for specific proteins that react with AIDS sera . 

30 Bacterial transformants containing either the vector alone, or the vector with the p31 DNA (pTP31.2) 
were grown in L-broth with 0.02% ampicillin to an ODeso of 0.5. Cultures were induced by the addition of 
IPTG to a final concentration of 2 mM and grown for 3 more hr. Bacteria from 1 ml cultures were pelleted 
and resuspended in 200 ul of gel sample buffer. The cells were disrupted by three cycles of freezing and 
thawing, boiled, and the extracts loaded onto 12.5% polyacrylamide-SDS minigels. Proteins were elec- 

35 trophoresed and transferred to nitrocellulose by electroblotting. The nitrocellulose filters were reacted with 
serum EW5111 (diluted 1:100: positive reference serum from the CDC that reacts strongly with viral p31), 
horse radish peroxidase-conjugated goat anti-human IgG and HRP substrate. A prominent band at -30,000 
d and several lower molecular weight species were seen in gels of extracts from transformants with the p31 
DNA, but not in extracts from bacteria transformed with the vector alone. 

40 

D. Demonstration that the polypeptide from the C-terminal region of the pol gene is analogous to the viral 
p31 protein. — — 

Lysozyme-NP40 extracts were prepared from bacteria transformed with pTP31 .2 or vector alone. Five 
45 ml cultures were grown, the cells pelleted and resuspended in 1 ml of 50 mM Tris-HCI pH 8, 0.5 mM EDTA, 
1 mg/ml lysozyme and incubated at 0°C for 15 min NaCI, MgCI 2 , and NP40® were added to final 
concentrations of 0.4, 5 mM and 0.5% respectively, mixed and incubated with DNAse I (100 ug/ml) at 0°C 
for 30 min. When EW5111 serum (diluted 1:100) was preincubated with a 1:10 dilution of the cell extracts 
from bacteria transformed with pTP31.2, prior to reaction with a virus blot, the viral p3l band was 
so completely eliminated, while reactivity with other viral proteins remained unaffected. In contrast, extracts 
from bacteria transformed with the vector alone did not absorb out the p31 reactive antibodies. The viral 
p31 protein is thus the product of the C-terminal or endonuclease region of the pol gene of ARV-2. 



55 
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15. Expression of p31 pd protein in yeast . 

A. Construction of a yeast vector p31/GAP-ADH2: Cloning of p31 into pAB24 . 

5 The ARV248NL fragment was cloned into pBS100 previously cut with Ncol and Sail. pBSlOO is a 
bacterial vector derived from pAB12 with a BamHI cassette consisting of the GAP-ADH2 promoter, an ARV- 
env gene as an Ncol-Sall fragment, and the GAP terminator. The BamHI cassette from two positive clones 
of pBS1007p31/GAP-ADH2 was cloned into pAB24, a yeast vector with both ura and leu selection 
capabilities. Both orientations of the cassette in this vector were screened for and used to transform the 

70 yeast strain AB110 (Mat a , ura 3-52 , leu 2-04 , or both leu 2-3 and leu 2-112 , pep 4-3 , his 4-580 , cir*). 
These cells were plated in both ura- and leu- plates. Also, ura- cells were plated onto leu- plates. 

B. Induction of p31 expression. 

75 Three different induction procedures were done: 1 .- Ura- colonies patched on ura- plates were induced 
for 24 hr in YEP/1% glucose. Both a Western and a polyacrylamide gel were run on these samples. Both 
results were negative. 2.- Colonies from ura- plates patched on leu- plates were induced in either leu-/3% 
ethanol or YEP/1% glucose for 24 hr. A Western and a polyacrylamide gel were run on these samples and 
the results were also negative. 3.- Colonies from leu-plates patched on leu- plates were induced in either 

20 leu-/3% ethanol or YEP/1% glucose for 24 hr. The polyacrylamide gel showed a negative result. No 
Western was run on these samples. 

1 6. Expression of superoxide dismutase (SOD)-p31 fusion protein in ye ast. 

25 A. Construction of pCl/l-pSP31-GAP-ADH2 derivative . 

For the construction of a gene for a fused protein SOD-p31 to be expressed in yeast, a plasmid 
(pSI4/39-2) was used. This plasmid contains the SOD gene fused to the proinsulin gene under the 
regulation of the ADH-2/GAP promoter. The proinsulin gene is located between EcoRI and Sail restriction 
30 sites. To substitute the proinsulin gene with the ARV248NL fragment, two oligomers designated ARV-300 
and ARV-301, respectively, were synthesized using phosphoramidite chemistry. The sequences generate 
cohesive ends for Eco RI and Nco l on each side of the molecule when the two oligomers are annealed. 
ARV-300 and ARV-301 have the sequences: 

35 

ARV-300 5' AATTCAGGTGTTGGAGC 

GTCCACAACCTCGGTAC 5' ARV-301 

40 Two ug of pSI4/39-2 linearized with EcoRI were ligated to 100 picomoles each of phosphorylated ARV- 
300 and dephosphorylated ARV-301 in thepresence of ATP and T4 DNA ligase in a final volume of 35 ul. 
The reaction was carried out at 14 *C for 18 hr. The DNA was further digested with Sail and the fragments 
were resolved on a 1% low melting point agarose gel and a fragment containing thevector plus the SOD 
gene (-6.5 kb) was purified as described above and resuspended in 50 ul of TE. Five ul of this preparation 

45 were ligated to 5 ul of ARV248NL in 20 ul final volume for 18 hr at 14° C and 5 ul used to transform 
competent HB101 cells. The resultant plasmid was called pSP31. Twenty ug of this plasmid were digested 
with Bam HI and a fragment of about 2900 bp was isolated by gel electrophoresis, resuspended in TE and 
ligated to pCI/l previously cut with Bam HI. This DNA was used to transform HB101 and transformants with 
the Bam HI cassette were obtained. Yeast strains 2150, P017, and AB110 were transformed with this pCI/l- 

50 pSP31-GAP-ADH2 derivative, both short and long orientations. The strain 2150 gave no transformants. All 
other transformants were patched on leu- plates. 

B. Induction of pCI/l-pSP31-GAP-ADH2 . 

55 Three different kinds of inductions were tried: 1.- P017 colonies were induced in either a 10 ml culture 
of YEP/1% glucose or a leu:/3% ethanol culture for 24 hr. The yeast pellets were analyzed by both 
polyacrylamide gels and Westerns and even though the Coomassie-stained gel showed a negative result, 
the Western did light up a band of the correct molecular weight with both induction methods. 2.- P017 
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colonies were induced in a 30 ml culture of YEP/1% ethanol for 48 hr. Aliquots were analyzed by PAGE at 
various time points during the induction. The Coomassie-stained gel shows a band in the correct molecular 
weight range (47-50 kd) that appears after 14 hr in YEP/1% ethanol and reaches a maximum intensity at 24 
hr of induction. The Western result correlates well with the Coomassie-stained gel, showing strong bands at 
5 24 and 48 hr. 3.-AB110 colonies were induced in either leu-/3% ethanol or YEP/1% glucose for 24 hr. PAGE 
and Westerns were run and the results were negative for the PAGE and positive for the Western, in both 
induction methods. 

17. Purification and characterization of SOD-p31 from bacteria or yeast 
70 " ~~~ ~ 

Frozen bacteria (yeast) cells are thawed at room temperature and suspended in 1.5 volumes of lysis 
buffer (20 mM Tris-HCI, pH 8.0, 2 mM EDTA, 1 mM PMSF, for bacteria; 50 mM Tris-CI, pH 8.0, 2 mM 
EDTA, 1 mM PMSF for yeast), and mixed with 1 volume of acid-washed glass beads. 

Cells are broken for 15 min in a non-continuous mode using the glass chamber of a Dynomill unit at 
75 3,000 rpm, connected to a -20 °C cooling unit. Glass beads are decanted for 2-3 min on ice, the cell lysate 
is removed. The decanted glass beads are washed twice with 30 ml of lysis buffer at 4°C. The cell lysate is 
centrifuged at 39,000 x g for 30 min. 

The pellet obtained from the above centrifugation is washed once with lysis buffer, after vortexing and 
suspending it at 4 Q C (same centrifugation as above). The washed pellet is treated with 0.2% SDS (for 
20 bacteria) and 0.1% SDS (for yeast) in lysis buffer and is agitated by rocking at 4*C for 10 min. The lysate 
is centrifuged at 39,000 x g for 30 min. The pellet is boiled in sample buffer (67.5 mM Tris-CI. pH 7.0, 5% 
0-mercaptoethanol, 2.3% SDS) for 10 min and centrifuged for 10 min at 39,000 x g. The supernatant is 
recovered and further centrifuged at 100,000 x g for 60 min (60 Ti rotor). This step is replaced by a 0.45 
urn filtration when yeast is used. The supernatant from the above centrifugation is loaded (maximum 50 mg 
25 of protein) on a gel filtration column (2.5 x 90 cm, ACA 34 LKB) with a flow rate of 0.3-0.4 ml/min, 
equilibrated with phosphate-buffered saline (PBS), 0.1% SDS. The fractions containing SOD-p31 are pooled 
and concentrated either by vacuum dialysis or using a YM5 Amicon membrane at 40 psi. The protein is 
stored at -20 * C as concentrated solution. 

Gel electrophoresis analysis shows that the SOD-p31 protein migrates having a, molecular weight of 
30 about 46 kd and is over 90% pure. 

18. ELISA for antibodies to HIV-I using recombinant ARV-2 polypeptides 

Stock solutions of purified p25 gag protein (1.25 mg/ml in 20 mM sodium phosphate, .0.1% SDS, pH 
35 7.2), purified env protein (2 mg/ml in 20 mM sodium phosphate, 0.1% SDS, pH 7.2), and purified SOD-p31 
fusion protein (2 mg/ml in 20 mM sodium phosphate, 0.1 % SDS, pH 7.2) were prepared. 

For coating microtiter plates (Dynatech Immulon I), 1 part each of the stock solutions of p25 gag, env, 
and SOD-p31 were added to 997 parts of borate coating buffer (0.05 M borate, pH 9.0). One hundred 
microliters of the coating solution was added to each well, and the plates were covered and incubated 2 hr 
40 at 37 'C or 12 hr at 4'C. The coating solution was then aspirated from the wells and the plates washed 6 x 
with wash solution (0.137 M 0.8% NaCI, 0.05% Triton® X-100). 

Serum samples were diluted 1:100 in dilution solution (0.1% casein, 1 mM EDTA, 1% Triton® X-100, 
0.5 M NaCI, 0.01% thimerosal, pH 7.5) with yeast protein (strain AB103.1) extract (1:40 dilution, approxi- 
mately 2 mg protein per ml in PBS containing 1% Triton X-100, 2 mM PMSF, 0.01% thimerosal) and E. coli 
45 protein extract (1:40 dilution, approximately 1 mg protein per ml in PBS containing 1% Triton® X-100, 2 
mM PMSF, 0.01% thimerosal) added to the dilution solution. Extraction procedures were similar to those 
described in 13 and 14 above but using non-recombinant strains. One hundred microliters of diluted serum 
was added to each well and incubated 30 min at 37° C. The plates were then washed 6 x with wash 
solution. 

so Goat anti-human Ig labeled with horseradish peroxidase (Cappel) diluted 1:8000 in dilution solution 
without added yeast and E. coli extracts were added at 100 ul/well to the plates and incubated 30 min at 
37 # C. The plates were then washed 6 x with wash solution. Substrate solution (10 ml citrate buffer, 10.5 g 
citric acid/liter dH 2 0, pH to 4.0 with 6 M NaOH), 0.1 ml ABTS (15 mg/ml 2,2 , -azino-di-(3-ethyl-benz- 
thiazolene sulfonic acid) in dH 2 0) and 3.33 ul H2O2) at 100 ul/well was then added to the plates and the 

55 plates wrapped in foil and incubated at 37 *C for 30 min. The reaction was then stopped by adding 50 
ul/well of 10% SDS. Readings were made with a Dynatech ELISA reader set for dual wavelength reading: 
absorbance wavelength of 1 (410 nm) and reference wavelength of 4. 
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Results 

The following sera were tested: 

A. 89 consecutive blood donors from the Kansas City Blood Bank ("normal blood donors"): log nos. 
5 1001-1081,1085-1092. 

B. 52 sera from patients with lymphadenopathy syndrome (LAD) or AIDS or sexual partners of persons 
with LAD or AIDS (referred to as "contacts ")~al I obtained from UCSF AIDS Serum Bank panel: log nos. 
4601-4652. 

The positive/negative cut-off used was 5 x (average background signal - signal with diluent alone) and 
10 was determined to be 0.195. Thus, sera with signals below 0.195 were rated (-); those above were rated 
( + ). Each sample was also evaluated by the commercially available ABBOTT HTLV III EIA kit (Abbott Labs) 
and by Western analysis. 

Tests on the normal blood donor samples indicated all except one were negative in the invention ELISA. 
This normal serum scored negative in the ABBOTT HTLV III EIA test, but was actually positive, as 
75 confirmed by Western analysis. 
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The results of the tests on the 52 sera from LAD and AIDS patients and contacts are tabulated below: 





Serum 




ABBOTT 


Invention 




5 


No. 


Diaqnosis 


EIA 


EL ISA 




Weste 




4601 


Contacts 


4 


1.89 


4 


4 




02 


Contacts 


_ 


0.04 




«. 




03 


Contacts 


4 


1.44 


4 


4 


10 


04 


Contacts 


4 


1.92 


4 


4 




05 


Contacts 


~ 


0.04 


_ 


— 




06 


Contacts 


4 


>2 


4 


4 




07 


Contacts 


4 


. 1.37 


4 


4 




08 


Contacts 


4 


1.60 


4 


4 


15 


09 


Contact^ 


4 


>2 


4 


4 • 




10 


Contacts 


4 


>2 


4 


4 




11 


Contacts 


4 


1.94 


4 


4 




12 


Contacts 


+ 


>2 


4 


+ 




13 


Contacts 


4 


>2 


4 


4 


20 


14 


Contacts 


4 


>2 


4 


4-. 




15 


Contacts 


4 


1.97 


4 


4 




16 


AIDS 


4 


0.61 


4 


4 




17 


AIDS 


4 


>2 


4 


4 




18 


AIDS 


4 


>2 


4 


4 


25 


19 


AIDS 


+ 


1.58 


4 


4 




20 


AIDS 


4 


1.58 


4 


4- 




21 


AIDS 


4 


0.76 


4 


4 




22 


AIDS 


4 


1.74 


4 


4 




23 


LAD 


4 


1.26 


4 


4 


30 


24 


LAD 


• 4 


>2 


4 


4 




25 


AIDS 


4 


1.04 


4 


4 




26 


AIDS 


4 


1.24 


4 


4 




27 


AIDS 


4 


1.40 


4 


4> 




28 


AIDS 


_ 


0.07 






35 


29 


LAD 


4 


1.93 


4 


4 




30 


Contacts 


4 


1.96 


4 


4 




31 


AIDS 


4 


1.76 


4 


4- 




32 


AIDS 


4 


0.90 


4 


4 




33 


AIDS 


4 


1.69 


4 


4 


40 


34 


LAD 


+ 


1.09 


4 


4 




35 


AIDS 


4 


1.54 


4 


4 




36 


AIDS 


4 


1.22 


4 


4 




37 


AIDS 


4 


1.96 


4 


4 




38 


AIDS 




>2 


4 


4 


45 


3 9 


LAD 


4 


1.85 


4 


4 




40 


LAD 


4 


>2 


4 


4 




41 


LAD 


4 


0.84 


4 


4 




42 


LAD 


4 


1.59 


4 


. 4- 




43 


LAD 


4 


1.71 


4 


4 


50 


44 


AIDS 


4 


1.40 


4 


+ 




45 


LAD 


4 


>2 


4 


4 




46 


AIDS 


4 


1.38 


4 


4 
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Secum 


Diaqnosis 


ABBOTT 


Invention 




No . 


EIA 


ELISA 




Western 


47 


AIDS 




1.29 


+ 


+ 


48 


LAD 


+ 


1.93 


+ 


+ 


49 


LAD 


+/- 


0.48 


+ 


+ 


50 


LAD 




0.04 






51 


LAD 




0.07 






52 


LAD 


+ 


1.92 


+ 


+ 



10 

The above results show that the invention ELISA using recombinant ARV proteins is at least as good as 
the ABBOTT HTLV III EIA test or Western analysis. 

In the invention ELISA reported in this example the yeast and bacterial extracts were added to the 
75 serum to bind serum antibodies to yeast and bacteria to prevent such antibodies from binding to the 
recombinant ARV-2 proteins. Both yeast and bacterial extracts were required since the recombinant 
polypeptides included polypeptides expressed in yeast and polypeptides expressed in bacteria. If all the 
polypeptides were expressed in the same type of organism, only one extract would be needed. For 
instance, if a p25 gag polypeptide expressed in yeast was substituted for the bacterially produced p25 gag 
20 polypeptide of the example, only yeast extract would be added to the serum samples. 

19. Dot-blot assay for antibodies to HIV-I using recombinant ARV-2 polypeptides. 

Nitrocellular strips (0.5 x 5 cm) are spotted with 50 ng polypeptide in PBS (spotting volume 2 u*l). After 
25 spotting the strips are dried at room temperature for 1 hr or more. The strips are then post-coated in a 5% 
solution of Carnation non-fat dry milk in PBS, 0.01% Thimerosal, for 15-60 min at room temperature. Each 
test solution sample is diluted 1 :50 in 0.5 ml of the post-coating solution in a test tube. A post-coated strip 
is then placed in the tube and incubated in the sample with rocking at 37 °C for 1 hr. The strip is then 
removed from the tube and washed with post-coating solution. The strip is then incubated for 15 min at 
30 room temperature in goat anti-human Ig reagent labeled with horse radish peroxidase diluted 1:500 in post- 
coating solution. After incubation in the labeled antibody, the strip is washed serially with PBS, 1% Triton, 
and distilled water. The strips are developed by incubating them in substrate solution (see 23 above) for 15 
min at room temperature. 

Positive samples will cause a visually perceptible color change at the spotting site. Normal (negative) 
35 sera sample yield no color change or give a faint signal that is discernible from a positive signal. 
Competition assays may be run on sera giving faint signals to verify that they are negative. In the 
competition assay, polypeptide (10-25 ug/ml) is added to the test sample and incubated from 1 hr at 37 °C 
before the strip is incubated in the sample. With authentic positive sera the signal is completely blocked by 
the added polypeptide, whereas with normal (negative) sera there is no change in signal. 
40 Samples of organisms that express the above-described ARV-2 p25 gag and ARV-2 env polypeptides 
and the fusion protein of ARV-2 p31 and SOD were deposited at the American Type Culture Collection 
(ATCC), 12301 Parklawn Drive, Rockville, Maryland under the provisions of the Budapest Treaty. The 
accession numbers and dates of these deposits are listed below. 



Expression Product 


ATCC Accession No. 


Deposit Date 


ARV-2 p25 gag 
ARV-2 env 
ARV-2 p31/SOD 


53246 
20769 
20768 


27 August 1985 
27 August 1985 
27 August 1985 



Claims 

1. A recombinant DNA construct useful for the expression of a recombinant polypeptide in a cell 
containing the construct, the construct comprising control sequences which regulate transcription and 
translation of the recombinant polypeptide in the cell and a coding sequence regulated by the control 
sequences, wherein the coding sequence comprises a DNA sequence of at least about 21 bp in 
reading frame characterised in that the DNA sequence encodes an antigenic HIV-I amino acid 
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sequence of Figure 2 which sequence is immunologically non-cross-reactive with HTLV-I and HTLV-II 
and is reactive with HIV-I. 

2. A recombinant DNA construct according to claim 1, which is useful for expression in a eucaryotic cell. 

5 

3. A recombinant DNA construct according to claim 1, which is useful for expression in a yeast cell. 

4. A recombinant DNA construct according to claim 1 which is useful for expression in a bacterial cell. 

w 5. A recombinant DNA construct according to any one of claims 1 to 4, characterized in that the DNA 
sequence encodes an amino acid sequence from an env polypeptide of HIV-I. 

6. A recombinant DNA construct according to claim 5, wherein the DNA sequence encodes a complete 
env polypeptide. 

15 

7. A recombinant DNA construct according to any one of claims 1 to 4, characterised in that the DNA 
sequence encodes an amino acid sequence from a gag polypeptide of HIV-I. 

8. A recombinant DNA construct according to claim 7, wherein the DNA sequence encodes a complete 
20 gag polypeptide. 

9. A recombinant DNA construct according to any one of claims 1 to 4, characterised in that the DNA 
sequence encodes an amino sequence from a pol polypeptide of HIV-I. 

25 10. A recombinant DNA construct according to claim 9, wherein the DNA sequence encodes a complete 
pol polypeptide. 

11. A cell comprising a recombinant DNA construct according to any one of claims 1 to 10, wherein the 
cell expresses the antigenic HIV-I amino acid sequence and is free from other cells which do not 

30 express the antigenic HIV-I amino acid sequence. 

12. A cell according to claim 11, wherein the recombinant DNA construct comprises a replication system 
recognised by the cell. 

35 13. A cell according to claim 12 wherein the cell is eukaryotic. 

14. A cell according to any one of claims 1 1 to 13, which is a yeast. 

15. A method of producing a recombinant polypeptide comprising an antigenic HIV-I amino acid sequence 
40 wherein a population of cells according to claim 11 is cultured under conditions whereby the 

recombinant polypeptide is expressed. 

16. A method according to claim 15, wherein the cells are eukaryotic. 

45 17. A method according to claim 15, wherein the cells are yeast or bacteria. 

18. An immunoassay for detecting antibodies to HIV-I in a sample suspected of containing the antibodies, 
characterised in that at least one recombinant polypeptide is used to bind the antibodies and the 
recombinant polypeptide comprises an antigenic env, gag or pol HIV-I amino acid sequence contained 

50 in the sequence shown in Figure 2, which polypeptide is immunologically non-cross-reactive with 
HTLV-I and HTLV-II. 

19. An immunoassay according to claim 18, wherein at least one env amino acid sequence and one gag 
amino acid sequence are used to bind the antibodies. 

55 

20. An immunoassay according to claim 19, wherein at least one pol amino acid sequence is also used to 
bind the antibodies. 
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21. A diagnostic reagent, immunogen or vaccine capable of binding an anti-HIV-l antibody in human serum 
characterised in that said reagent, immunogen or vaccine consists of an antigen comprising an 
immunogenic fragment of at least seven amino acids of an HIV-I env, gag or pol polypeptide, which 
fragment is immunologically non-cross-reactive with HTLV-I and HTLV-II and which has a sequence 

5 contained in the sequence shown in Figure 2. 

22. A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant 
DNA construct according to claim 5. 

io 23^ A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant 
DNA construct according to claim 6. 

24. A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant 
- DNA construct according to claim 7. 

\5 

25. A recombinant polypeptide according to claim 24, wherein the gag amino acid sequence comprises 
p16gag. 

26. A recombinant polypeptide according to claim 24, wherein the gag amino acid sequence comprises 
20 p25 gag. ^ 

27. A recombinant polypeptide according to claim 24, wherein the gag amino acid sequence comprises a 
fusion protein of p16 gag and p25 gag amino acid sequences. 

25 28. A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant 
DNA construct according to claim 10. 

29. A recombinant polypeptide according to claim 28, wherein the pol amino acid sequence comprises p31 
pol. 

30 

30. A recombinant polypeptide according to claim 28, wherein the pol amino acid sequence comprises a 
fusion protein of superoxide dismutase and p31 pol amino acid sequences. 

31. An article of manufacture for use in an immunoassay for HIV-I. antibodies characterised in that it 
35 comprises a solid support having bound thereto a recombinant polypeptide according to claim 22. 

32. An article of manufacture for use in an immunoassay for HIV-I antibodies characterised in that it 
comprises a solid support having bound thereto recombinant polypeptide according to claim 24. 

40 33. An article of manufacture for use in an immunoassay for HIV-I antibodies characterised in that it 
comprises a solid support having bound thereto a recombinant polypeptide according to claim 28. 

34. A DNA sequence encoding a HIV-I polypeptide derived from a phage selected from ARV-2 (7D) (ATCC 
No. 40143), ARV-2 (8A) (ATCC No. 40144), and ARV-2 (9B) (ATCC No. 40158). 

45 

35. A recombinant DNA construct capable of expressing an antigenic recombinant HIV-I polypeptide 
derived from an organism selected from ATCC No. 53246, ATCC No. 20769 and ATCC No. 20768. 

36. An isolated polynucleotide comprising the ARV-2 sequence of Figure 2 or, a fragment of at least 21 bp 
so thereof, provided that said fragment of at least 21 bp is not 

i) a 3.5 kb viral insert from HTLV-I II recombinant clone BH5, 

ii) a 5.5 kb viral insert from HTLV-III recombinant clone BH8, or 

iii) a 9.0 kb viral insert from HTLV-III recombinant clone BH10 
disclosed in published European patent application EP-A1 -01 73529 

55 further provided that said fragment of at least 21 bp is not 

a) a 0.6 kbp viral insert from LAV designated LAV75, 

b) a 0.8 kbp viral insert from LAV designated LAV82, 

c) a 2.5 kbp viral insert from LAV designated LAV13, 
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d) a 9.1 to 9.2 kbp viral Insert from phage XJ19, 

e) a DNA fragment extending from Kpnl (6100) to approximately BamHI (8150) of XJ19, 

f) a DNA fragment extending from approximately Kpnl (3500) to approximately Bgllll (6500) of XJ19, 
or 

5 g) a DNA fragment extending from approximately Pst(800) to approximately Kpnl(3500) of XJ19 

disclosed in published European patent application EP-A1 -01 78978 
further provided that said fragment of at least 21 bp is not 

I a 2.3 kbp Kpnl : Kpnl fragment, 

II a 1 .0 kbp EcoRI-EcoRI fragment, or 
io III a 2.4 kbp EcoRI-Hindlll fragment 

disclosed in published European patent application EP-A2-01 85444. 

Patentanspruche 

75 1. Rekombinantes DNA-Konstrukt zur Expression eines rekombinanten Polypeptids in einer das Konstrukt 
enthaltenden Zelle, wobei das Konstrukt Kontrollsequenzen umfaBt, die die Transkription und die 
Translation des rekombinanten Polypeptids in der Zelle regulieren und eine von den Kontrollsequenzen 
regulierte codierende Sequenz, wobei die codierende Sequenz eine DNA-Sequenz von mindestens 
etwa 21 bp im Leserahmen umfaBt, dadurch gekennzeichnet, daB die DNA-Sequenz eine gegen HIV-I 

20 antigene Aminosauresequenz von Fig. 2 codiert, wobei die Sequenz mit HTLV-I und HTLV-II immunolo- 
gisch nicht kreuzreaktiv und mit HIV-I reaktiv ist. 

2. Rekombjnantes DNA-Konstrukt nach Anspruch 1 zur Expression in einer eukaryontischen Zelle. 

25 3. Rekombinantes DNA-Konstrukt nach Anspruch 1 zur Expression in einer. Hefezelle. 

4. Rekombinantes DNA-Konstrukt nach Anspruch 1 zur Expression in einer bakteriellen Zelle. 

5. Rekombinantes DNA-Konstrukt riach einem der AnsprOche 1 bis 4, dadurch gekennzeichnet, daB die 
30 DNA-Sequenz eine Aminosauresequenz eines env-Polypeptids von HIV-I codiert. 

6. Rekombinantes DNA-Konstrukt nach Anspruch 4, wobei die DNA-Sequenz ein komplettes env-Polypep- 
tid codiert. 

35 7. Rekombinantes DNA-Konstrukt nach einem der Anspruche 1 bis 4, dadurch gekennzeichnet, daB die 
DNA-Sequenz eine Aminosauresequenz von einem gag-Polypeptid von HIV-I codiert. 

8. Rekombinantes DNA-Konstrukt nach Anspruch 7, wobei die DNA-Sequenz ein komplettes gag-Polypep- 
tid codiert. 

40 

9. Rekombinantes DNA-Konstrukt nach einem der AnsprUche 1 bis 4, dadurch gekennzeichnet, daB die 
DNA-Sequenz eine Aminosauresequenz eines pol-Polypeptids von HIV-I codiert. 

10. Rekombinantes DNA-Konstrukt nach Anspruch 9, wobei die DNA-Sequenz ein komplettes pol-Polypep- 
45 tid codiert. 

11. Zelle, die ein rekombinantes DNA-Konstrukt nach einem der Anspruche 1 bis 10 enthalt, wobei die 
Zelle die gegen HIV-I antigene Aminosauresequenz exprimiert und frei von anderen Zellen ist, die die 
gegen HIV-I antigene Aminosauresequenz nicht exprimieren. 

50 

12. Zelle nach Anspruch 1 1 , wobei das rekombinante DNA-Konstrukt ein von der Zelle erkanntes Replika- 
tionssystem umfaBt. 

13. Zelle nach Anspruch 12, wobei die Zelle eukaryontisch ist. 

55 

14. Zelle nach einem der Anspruche 11 bis 13, wobei die Zelle eine Hefe ist. 
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15. Verfahren zur Herstellung eines rekombinanten Polypeptids, das eine gegen HIV-I antigene Aminosau- 
resequenz umfaBt, wobei eine Population von Zellen nach Anspruch 11 unter Bedingungen gezUchtet 
wird, unter denen das rekombinante Polypeptid exprimiert wird. 

5 16. Verfahren nach Anspruch 15, wobei die Zellen eukaryontisch sind. 

17. Verfahren nach Anspruch 15, wobei die Zellen Hefe Oder Bakterien sind. 

18. Immunassay zum Nachweisvon Antikorpern gegen HIV-I in einer Probe, von der angenommen wird, 
10 daB sie die Antikorper enthalt, dadurch gekennzeichnet, daB mindestens ein rekombinantes Polypeptid 

zur Bindung der Antikorper verwendet wird und das rekombinante Polypeptid eine antigene env-, gag- 
oder pol-HIV-l Aminosauresequenz umfaBt, die in der in Figur 2 gezeigten Sequenz enthalten ist, wobei 
das Polypeptid mit HTLV-I und HTLV-II immunologisch nicht kreuzreaktiv ist. 

75 19. Immunassay nach Anspruch 18, wobei mindestens eine env- Aminosauresequenz und eine gag- 
Aminosauresequenz zur Bindung der Antikorper verwendet werden. 

20. Immunassay nach Anspruch 19, wobei mindestens eine pol-AminosMuresequeriz zur Bindung der 
Antikorper zusatzlich verwendet wird. 

20 

21. Diagnostisches Reagenz, Immunogen oder Impfstoff,, fahig zur Bindung eines anti-HIV-l-Antikorpers in 
menschlichem Serum, dadurch gekennzeichnet, daB das Reagenz, Immunogen Oder der Impfstoff aus 
einem Antigen besteht, das ein immunogenes Fragment von . mindestens sieben Aminosauren eines 
HIV-I env-, gag- Oder pol-Polypeptids umfaBt, wobei das Fragment mit HTLV-I und HTLV-II immunolo- 

25 gisch nicht kreuzreaktiv ist und einein der in Fig. 2 gezeigten Sequenz enthaltene Sequenz besitzt. 

22. Rekombinantes Polypeptid, dadurch gekennzeichnet, daB es von einer Zelle hergestellt wird, die mit 
einem rekombinanten DNA-Konstrukt nach Anspruch 5 transformiert ist. 

30 23. Rekombinantes Polypeptid , dadurch gekennzeichnet, daB es von einer Zelle hergestellt wird, die mit 
einem rekombinanten DNA-Konstrukt nach Anspruch 6 transformiert ist. 

24. Rekombinantes Polypeptid, dadurch gekennzeichnet, daB es von einer Zelle hergestellt wird, die mit 
einem rekombinanten DNA-Konstrukt nach Anspruch 7 transformiert ist. 

35 

25. Rekombinantes Polypeptid nach Anspruch 24, wobei die gag-Aminosauresequenz p16 gag umfaBt. 

26. Rekombinantes Polypeptid nach Anspruch 24, wobei die gag-Aminosauresequenz p25 gag umfaBt. 

40 27. Rekombinantes Polypeptid nach Anspruch 24, wobei die gag-Aminosauresequenz ein Fusionsprotein 
aus p16 gag und p25 gag Aminosauresequenzen umfaBt. 

2a Rekombinantes Polypeptid, dadurch gekennzeichnet, daB es in einer Zelle hergestellt wird, die mit 
einem rekombinanten DNA-Konstrukt nach Anspruch 10 transformiert ist. 

45 

29. Rekombinantes Polypeptid nach Anspruch 28, wobei die pol-Aminosauresequenz p31 pol umfaBt. 

30. Rekombinantes Polypeptid nach Anspruch 28, wobei die pol-Aminosauresequenz ein Fusionsprotein 
aus Superoxiddismutase und der p31 pol-Aminosauresequenz umfaflt. 

50 

31. Erzeugnis zur Verwendung in einem Immunassay fur HIV-l-Antikorper, dadurch gekennzeichnet, daB es 
eine teste Unterlage umfaBt, an die ein rekombinantes Polypeptid nach Anspruch 22 gebunden ist. 

32. Erzeugnis zur Verwendung in einem Immunassay fur HIV-l-Antikorper, dadurch gekennzeichnet, daB es 
55 eine teste Unterlage umfaBt, an die ein rekombinantes Polypeptid nach Anspruch 24 gebunden ist. 

33. Erzeugnis zur Verwendung in einem Immunassay fiir HIV-l-Antikorper, dadurch gekennzeichnet, daB es 
eine teste Unterlage umfaBt, an die ein rekombinantes Polypeptid nach Anspruch 28 gebunden ist. 
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34. DNA-Sequenz, die ein HIV-l-Polypeptid codiert und von dem Phagen ARV-2 (7D) (ATCC Nr. 40143), 
ARV-2 (8A) (ATCC Nr. 401 44) Oder ARV-2 (9B) (ATCC Nr. 401 58) stammt. 

35. Rekombinantes DNA-Konstrukt fahig zur Expression eines antigenen rekombinanten HIV-l-Polypeptids, 
5 das aus einem Organismus mit der ATCC Nr. 53246, ATCC Nr. 20769 oder ATCC Nr. 20768 stammt. 

36. Isoliertes Polynucleotid, das die ARV-2-Sequenz aus Figur 2 oder ein Fragment mit mindestens 21 bp 
davon umfaBt, vorausgesetzt, daB das Fragment mit mindestens 21 bp nicht ist 

i) eine virale 3,5 kb Insertion aus dem rekombinanten HTLV-III Clon BH5, 
io ,ii) eine virale 5,5 kb Insertion aus dem rekombinanten HTLV-III Clon BH8, oder 

iii) eine virale 9,0 kb Insertion aus dem rekombinanten HTLV-III Clon BH10, 
offenbart in der veroffentlichten europaischen Patentanmeldung EP-A1 0 173 529 
ferner vorausgesetzt, daB das Fragment mit mindestens 21 bp nicht ist 
a) eine virale 0,6 kbp Insertion aus mit LAV75 bezeichnetem LAV, 
75 b) eine virale 0,8 kbp Insertion aus mit LAV82 bezeichnetem LAV, 

c) eine virale 2,5 kbp Insertion aus mit LAV13 bezeichnetem LAV, 

d) eine virale 9,1 bis 9,2 kbp Insertion aus dem Phagen XJ19, 

e) ein sich von Kpnl (6100) bis ungefahr BamHI (8150) von XJ19 erstreckendes DNA-Fragment, 

0 ein sich von ungefahr Kpnl (3500) bis ungefShr Bglll (6500) von XJ19 erstreckendes DNA- 
20 Fragment, oder 

g) ein sich von ungefahr Pstl (800) bis ungefahr Kpnl (3500) von XJ19 erstreckendes DNA-Fragment 
offenbart in der europaischen Patentanmeldung EP-A1-0 178 978 
ferner vorausgesetzt, daB das Fragment mit mindestens 21 bp nicht ist - 

1 ein 2,3 kbp Kpnl-Kpnl-Fragment, 

25 II ein 1,0 kbp EcoRI-EcoRI-Fragment, oder 

III ein 2,4 kbp EcoRI-Hindlll-Fragment, 
offenbart in der veroffentlichten europaischen Patentanmeldung EP-A2-0 185 444. 

Revendlcations 

30 

1. Produit d'assemblage d'ADN recombinant, utile pour Pexpression d'un polypeptide recombinant dans 
une cellule contenant le produit d'assemblage, le produit d'assemblage comprenant des sequences 
regulatrices qui reglent la transcription et la traduction du polypeptide recombinant dans la cellule et 
une sequence codante r£gul6e par les sequences regulatrices, dans lequel la sequence codante 
35 comprend une sequence d'ADN d'au moins environ 21 pb dans le cadre de lecture, caracteVise* en ce 
que la sequence d'ADN code pour une sequence d'aminoacides antigenique du HIV-I de la figure 2, 
laquelle sequence est immunologiquement non reactive en reaction croisee avec le HTLV-I et le HTLV- 
II et est reactive avec le HIV-I. 

40 2. Produit d'assemblage d'ADN recombinant selon la revendication 1, qui est utilisable pour I'expression 
dans une cellule eucaryote. 

3. Produit d'assemblage d'ADN recombinant selon la revendication 1, qui est utilisable pour I'expression 
dans une cellule de levure. 

45 

4. Produit d'assemblage d'ADN recombinant selon la revendication 1 , qui est utilisable pour I'expression 
dans une cellule bacterienne. 

5. Produit d'assemblage d'ADN recombinant selon Tune quelconque des revendications 1 a 4, caracteVise 
50 en ce que la sequence d'ADN code pour une sequence d'aminoacides provenant d'un polypeptide env 

de HIV-I. 

6. Produit d'assemblage d'ADN recombinant selon la revendication 4, dans lequel la sequence d'ADN 
code pour un polypeptide env complet. 

55 

7. Produit d'assemblage d'ADN recombinant selon Tune quelconque des revendications 1 a 4, caracterise 
en ce que la sequence d'ADN code pour une sequence d'aminoacides provenant d'un polypeptide gag 
de HIV-I. 
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8. Produit d'assemblage d'ADN recombinant selon la revendication 7, dans lequel la sequence d'ADN 
code pour un polypeptide gag complet. 

9. Produit d'assemblage d'ADN recombinant selon Tune quelconque des revendications 1 a 4, caracterise 
5 en ce que la sequence d'ADN code pour une sequence d'aminoacides provenant d'un polypeptide pol 

de HIV-I. 

10. Produit d'assemblage d'ADN recombinant selon la revendication 9, dans lequel la sequence d'ADN 
code pour un polypeptide pol complet. 

70 

11. Cellule comprenant un produit d'assemblage d'ADN recombinant selon I'une quelconque des revendi- 
cations 1 a 10, dans laquelle la cellule exprime la sequence antigenique d'aminoacides du HIV-I et est 
separee d'autres cellules qui n'expriment pas la sequence antigenique d'aminoacides du HIV-I. 

75 12. Cellule selon la revendication 11, dans laquelle le produit d'assemblage d'ADN recombinant comprend 
un systeme de replication reconnu par la cellule. 

13. Cellule selon la revendication 12, caract§ris6e en ce que la cellule est eucaryote. 

20 14. Cellule selon I'une quelconque des revendications 11 a 13, qui est une levure. 

15. Proc£de* de production d'un polypeptide recombinant comprenant une sequence antigenique d'aminoa- 
cides du HIV-I, dans lequel on cultive une population de cellules selon la revendication 11, dans des 
conditions dans lesquelles le polypeptide recombinant est exprime. 

25 

16. Proc6de selon la revendication 15, dans lequel les cellules sont eucaryotes. 

17. Procede selon la revendication 15, dans lequel les cellules sont des bacteries ou des levures. 

30 18. Essai immunologique pour la detection d'anticorps contre le HIV-I dans un £chantillon presume* contenir 
les anticorps, caracteVise* en ce qu'au moins un polypeptide recombinant est utilise* pour fixer les 
anticorps et le polypeptide recombinant comprend une sequence antigenique d'aminoacides env, gag 
ou pol du HIV-I contenue dans la sequence representee sur la figure 2, lequel polypeptide est 
immunologiquement non r£actif en reaction crois£e avec le HTLV-I et le HTLV-II. 

35 

19. Essai immunologique selon la revendication 18, dans lequel au moins une sequence d'aminoacides env 
et une sequence d'aminoacides gag sont utilises pour fixer les anticorps. 

20. Essai immunologique selon la revendication 19, dans lequel au moins une sequence d'aminoacides pol 
40 est egalement utilisee pour fixer les anticorps. 

21. Rgactif de diagnostic, immunogene ou vaccin capable de fixer un anticorps anti-HIV-l dans du seVum 
humain, caracterise* en ce que ledit reactif consiste en un antigene comprenant un fragment immunoge- 
ne d'au moins sept aminoacides d'un polypeptide env, gag ou pot du HIV-I, lequel fragment est 

45 immunologiquement non reactif en reaction croisee avec le HTLV-I et le HTLV-II et qui presente une 
sequence contenue dans celle representee a la Figure 2. 

22. Polypeptide recombinant, caracterise en ce qu'il est produit par une cellule transformed par un produit 
d'assemblage d'ADN recombinant selon la revendication 5. 

50 

2a Polypeptide recombinant, caracterise en ce qu'il est produit par une cellule transformed par un produit 
d'assemblage d'ADN recombinant selon la revendication 6. 

24. Polypeptide recombinant, caracterise* en ce qu'il est produit par une cellule transformee par un produit 
55 d'assemblage d'ADN recombinant selon la revendication 7. 

25. Polypeptide recombinant selon la revendication 24, dans lequel la sequence d'aminoacides gag 
comprend gag-p16. 
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26. Polypeptide recombinant selon la revendication 24, dans lequel la sequence d'aminoacides comprend 
gag-p25. 

27. Polypeptide recombinant selon la revendication 24, dans lequel la sequence d'aminoacides gag 
5 comprend une proteine de fusion de sequences d'aminoacides gag-p16 et gag-p25. 

2a Polypeptide recombinant, caracteVise" en ce qu'il est produit par une cellule transformed par un produit 
d'assemblage d'ADN recombinant selon la revendication 10. 

w 29. Polypeptide recombinant selon la revendication 28, dans lequel la sequence d'aminoacides pol 
comprend pol-p31 . 

30. Polypeptide recombinant selon la revendication 28, dans lequel la sequence d'aminoacides pol 
comprend une proteine de fusion de sequences d'aminoacides pol-p31 et de la superoxyde dismutase. 

75 

31. Article de fabrication pour utilisation dans un essai immunologique pour la detection d'anticorps contre 
le HIV-I, caracterise en ce qu'il comprend un support solide sur lequel est fixe un polypeptide 
recombinant selon la revendication 22. 

20 32. Article de fabrication pour utilisation dans un essai immunologique pour la detection d'anticorps contre 
le HIV-I, caracterise en ce qu'il comprend un support solide sur lequel est fixe un polypeptide 
recombinant selon la revendication 24. 

33. Article de fabrication pour utilisation dans un essai immunologique pour la detection d'anticorps contre 
25 le HIV-I, caracterise en ce qu'il comprend un support solide sur lequel est fixe un polypeptide 

recombinant selon la revendication 28. 

34. Sequence d'ADN codant pour un polypeptide de HIV-I derive d'un phage choisi parmi ARV-2 (7D) 
(ATCC n • 40143), ARV-2 (8A) (ATCC n • 40144) et ARV-2 (9B) (ATCC n • 40158). 

30 

35. Produit d'assemblage d'ADN recombinant, capable d'exprimer un polypeptide de HIV-I recombinant 
antigenique, provenant d'un organisme choisi parmi ATGC n° 53246, ATCC n* 20769 et ATCC n* 
20768. 

35 36. Polynucleotide isole" comprenant la sequence de ARV-2 de la figure 2 ou un fragment d'au moins 21 pb 
de celle-la, etant entendu que ledit fragment d'au moins 21 pb n'est pas 

I) un segment d'insertion viral de 3,5 kb provenant du clone recombinant BH5 de HTLV-III, 

II) un segment d'insertion viral de 5,5 kb provenant du clone recombinant BH8 de HTLV-III, 

III) un segment d'insertion viral de 9,0 kb provenant du clone recombinant BH10 de HTLV-III, 
40 decrits dans EP-A1 -0 1 73 529, 

etant en outre entendu que ledit fragment d'au moins 21 pb n'est pas 

a) un segment d'insertion viral de 0,6 kpb provenant de LAV, d£nomme* LAV75, 

b) un segment d'insertion viral de 0,8 kpb provenant de LAV, denomme* LAV82, 

c) un segment d'insertion viral de 2,5 kpb provenant de LAV, denomme LAV13, 
45 d) un segment d'insertion viral de 9,1 a 9,2 kpb provenant du phage XJ19, 

e) un fragment d'ADN s'&endant de Kpnl (6100) a approximativement BamHI (8150) de XJ19, 

f) un fragment d'ADN s'etendant d'approximativement Kpnl (3500) a approximativement Bgllll (6500) 
de XJ19 ni 

g) un fragment d'ADN s'etendant d'approximativement Pstl (800) a approximativement Kpnl (3500) 
so de\J19 

decrits dans EP-A1-0 178 978, 

§tant en outre entendu que ledit fragment d'au moins 21 pb n'est pas 

I) un fragment Kpnl-Kpnl de 2,3 kpb, 

II) un fragment EcoRI-EcoRI de 1 ,0 kpb ni 
55 III) un fragment EcoRI-Hindlll de 2,4 kpb 

decrits dans EP-A2-0 185 444. 
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hetlleAspLysAlaGlnGluGluHisGluLysTyrHisSerAsnTrp 
1 AGGXAACAG: : I x AT 6 AT 1 6 A : AAG6CACAAGAA6AACATGAGAAAT AT CACA6TA ATTGC5 
TCCXTTGTC; ; ; xTACTAiCTs TTCCGTGTTCTTCTT6TACTCTTTATAGTGTCATTAACC 

32 wboll , 36 nlalll , 

ArqAlaMetAlaSerAspPheAsnLeuProProValValAlaLysGluI leValAlaSer 
62 AGAGCCAT6GCTAGTGATTT T AACCTGCCACCTGTAGTAGCAAAAGAAAT AGT AGCCAGC 
T CT CGGT ACCGA TCACTAAAAT T GGACG6TGGACAT CATCGTTTTCT T TAT CATC6GTCG 

66 ncol, 67 nlalll, 116 nspBll pvull, 119 alul , 

CysAspLysCysGl nLeuLysGl yGluAl aMetHlsGl yGlnVa 1 AspCysSerProG 1 y 
122 TGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATG6ACAAGTAGACTGTAGTCCAGGA 
ACACTATTT ACAGT CGATTT T C CT CTTCGGT ACGT ACCTGTTCATCTGACAT C AGGTCCT 

135 alul, 151 nlalll, 152 nsil ava3, 155 nlalll, 164. accl , 1 
76 apyl bstXl ecorl 1 scrFl , 

1 leTrpGlnLeuAspCysThtHi5LeuGlu61yLy5lleIleLeuValAlaValHisVal 
1B2 AT ATGbCAACT AGAT 7 & TACA C AT CT AGAAGGAAAAATTATCCT6GTAGCAGT T C ATGTA 
T AT ACCGTTGAT CTAACATGTGT AGATCTTCCTTTTT AAT AGGACCAT CGTCAAGTACAT 

198 rial, 205 xbal, 223 apyl ecorll scrFl , 236 nlalll, 

AlaSerGlyTyrlleGluAlaGluVallleProAlaGluThrGlyGlnGluThrAlaTyr 
242 GCCAGTGGATATAT AGAA6C AGAAGT T AT TCCAGCAGAGACA6GGCA6GAAACAGCATAT 
CGGTCACCTATATATCTTCGTCTTCAAT AAGGTCGTCTCTGTCCCGTCCTTTGTC6TAT A 

263 xtnnl, 

PheLeuLeuLysLeuAl aGl yArqTrpProVa ILysThrl 1 eHi sThr AspAsnG 1 ySer 
302 T T T CT CT T AAAATTAGCAGGAAGATGGCCAGT AA AAACAAT ACAT ACAGACAAT6GCAGC 
AAAGAGAATTTTAAT CGTCCTT CT AC CGGT CAT T TTTGTT ATGT AT GTCTGTT ACCGT CG 

321 mboll, 326 ball cfrl hael , 327 haelll, 357 bbv InuAhl, 

AsnPheThrSerThrThrValLysAlaAlaCysTrpTrpAlaGlylleLysGlnGluPhe 
362 AATTTCACCAGTACTACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCAGGAATTT 
T T AAAGTGGT CATGATGCC AAT T CC6GCGGACAACCACCCGTCCCTAGTT CGT CCTTAAA 

366 hph, 371 seal, 372 rsal, 385 haelll, 386 inuAht nsbll, 4 
05 binl, 406 dpnl sau3a, 

GlylieProTyrAsnProGln5erGln61yValVal61u5erf1etAsriAsnGluLeuLys 
4 22 GGCATTCCCTACAATCCCCAAAGT CAAGGAGT AGT AGAAT CTATGAATAA TGAATTAAAG 
C CGT AAGG6ATGTT AGGGGT T T CAGTTCCTCATCATCTTAGATACTTATTACTTAATTTC 

423 bsml , 458 hinf 1 , 

Lysl lei leGlyGlnValArgAspGlnAlaGluHisLeuLysThrAlaValGlnMetAl a 
AAAATTATAGGACAGGTAAGAGAYCAGGCTGAACACCTTAAGACAGCAGTACAAATGGCA 
TTTTAATATCCTGTCCATTCT CT AGT CC6ACTTGTG6AAT TCT6T CGT CATGT TT ACCGT 

503 dpnl sau3a, 518 aflll, 530 rsal, 

ValPhel leHisAsnPheLysAtoLysGlyGlyl 1 eGl yG 1 y TyrSer A 1 aG 1 yG 1 uArg 
GTATTCATCCACAATTTTAAAAGAAAAGGGGGGATTGGGGGATACAGTGCAGGGGAAAGA 
C AT AAGTAGGTGTT AAAATT TTCTTTTCCCCCCTAACCCCCTATGTCACGTCCCCTTTCT 

547 f okl , 557 ahalll , 

1 leValAspIlelleAlaThrAspIleGinThrLysGluLeuGlnLysGlnlleThrLys 
AT AGT AGAC AT AAT AGCAA C AGACAT ACAAACT AAAGAACT ACAAAAGCAAAT T ACAAA A 
T AT CAT CTGT ATTATCGTTGT CTGTATGTTTGATTTCTTGATGTTTT CGTTTAAT GTT T T 



482 



542 



602 



662 



605 acd , 

I 1 eGl nAsnPheArgValTyrTyrArgAspAsnLysAspProLeuTrpLysGl yProA 1 a 
A IIJ05$ ATTTTC6e6TTTATT A CAG S 6A ^ AA CAAA6ATCCCCTTTGGAAAGGACCAGCA 
T AAGTT TT AAAAGCCCAAAT AATGT CCCTGTTGTT TCT AGGGGAAACCT TT CC TGG T CGT 

697 Kho2, 698 dpn! sau3a, 713 asul ava2, 
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Description 

[0001] This invention is in the field of genetic engineering. More particularly, it relates to recombinant viral proteins 
associated with lymphadenopathy syndrome and/or acquired immune deficiency syndrome. 

5 [0002] With the discovery of human T-cell lymphotropic Virus-I (HTLV-I) as an infectious agent in humans, it was 
established that retroviruses could infect humans and could be the etiological agent of disease. After HTLV-I was 
established, a second retrovirus of the same family, HTLV-II was found in a hairy cell leukemia established strain. Since 
that time, other human retroviruses have been isolated which are associated with lymphadenopathy syndrome (LAS) 
and/or acquired immune deficiency syndrome (AIDS) victims. Various retroviruses have been isolated from individuals 

10 with AIDS (sometimes called HTLV-III) or LAS (sometimes called LAV). See for example, Barre-Sinoussi, et al, Science 
(1983) 220:868-871 and Montagnier, et al, Cold Spring Harbor Symposium (1984) In press; Vilmer, et al, Lancet (1984) 
1:753, Popovic, et al, Science - (1984) 224:497 and Gallo, et al, Science (1984) 224:500. A comparison of HTLV-III 
and LAV may be found in Feorino, et al, (1 984), supra. See also, Klatzman, et al, Science (1 984) 225:59-62, Montagnier, 
et al, ibid (1984) 63-66, and the references cited therein for a survey of the field. A general discussion of the T-cell 

15 leukemia viruses may be found in Marx, Science (1984) 22:475-477. Levy, et al, Science (1 984) 225:840-842 report 
the isolation of ARV (AIDS-associated retroviruses). 

[0003] At the time of filing this application, these viruses (HTLV-III, LAV, and ARV) were generically referred to as 
human T-cell lymphotropic retrovirus (hTLR). However, from 1986 onwards the equivalent generic term "human im- 
mundeficiency virus" (HIV) was adopted as the recognised term for such viruses. Subsequently, a sub-division of the 

20 generic group HIV was necessary into HIV-I and HIV-ll. Since the application relates to ARV-2 isolates which are HIV-I 
isolates, HIV-I has been substituted throughout the application for hTLR (HIV) and the claims are accordingly limited 
to HIV-I. The HIVs (hTLRS) may be shown to be of the same class by being similar in their morphology, serology, 
reverse transcriptase optima and cytopathology, as identified in the above references. For example, the reverse tran- 
scriptase prefers Mg +2 , and has a pH optima of about 7.8. 

25 [0004] DNA clones containing HIV sequences are disclosed in EP-A1 -01 73529, EP-A1 -01 78978, EP-A2-01 85444 
and WO 86/02383. 

[0005] The present invention is defined in the claims. 
[0006] Figure 1 is a restriction map of proviral DNA (ARV-2). 

[0007] Figure 2 is the nucleotide sequence of ARV-2(9B). The amino acid sequences for the products of the gag, 
30 pol, and env genes are indicated. The U3, R, and U5 regions of the LTRs are also designated. The cap site is position 
+1 . A 3 bp inverted repeat at the ends of the LTR, the TATA box at position -29, the sequence complementary to the 
3'-end of the tRNA'vs at position 183, and the polyadenylation signal at position 9174 are underlined. The overlines 
indicate the amino sequences determined from virion proteins. The nucleotides at the beginning of each line are num- 
bered, and the amino acids at the end of each line are indicated. 
35 [0008] Figure 3 is a flow diagram showing the procedures for making the plasmid pGAG25-1 0. 

[0009] Figure 4 is the nucleotide sequence of the p25 gag gene cloned in plasmid pGAG25-10 and the amino acid 
sequence encoded by that gene. 

[001 0] Figure 5 is the coding strand of the nucleotide sequence cloned in pG AG41 -1 0 for producing the fusion protein 
p41 gag and the corresponding amino acid. 
40 [0011] Figure 6 is a nucleotide sequence coding for ARV-2 p16 gag protein that was cloned into plasmid ptac5 to 
make an expression plasmid for producing p16 gag protein in bacteria. 

[001 2] Figure 7 is a nucleotide sequence that encodes ARV-2 env protein that was used to prepare plasmid pDPC303. 
[0013] The HIV-I DNA sequences, either isolated and cloned from proviral DNA or cDNA or synthesized, may be 
used for expression of polypeptides which may be a precursor protein subject to further manipulation by cleavage, or 

45 a complete mature protein or fragment thereof. The smallest sequence of interest, so as to provide a sequence encoding 
an amino acid sequence capable of specific binding to a receptor, e.g., an immunoglobulin, will be 21 bp, usually at 
least 45 bp, exclusive of the initiation codon. The sequence may code for any greater portion of or the complete polypep- 
tide, or may include flanking regions of a precursor polypeptide, so as to include portions of sequences or entire se- 
quences coding for two or more different mature polypeptides. The sequence will usually be less than about 5 kbp, 

so more usually less than about 3 kbp. 

[0014] Sequences of particular interest having open reading frames (Figure 2) define the structural genes for the 
gag proteins (p16 and p25) and the env protein. It is to be understood that the above sequences may be spliced to 
other sequences present in the retrovirus, so that the 5*-end of the sequence may not code for the N-terminal amino 
acid of the expression product. The splice site may be at the 5-terminus of the open reading frame or internal to the 

55 open reading frame. The initiation codon for the protein may not be the first codon for methionine, but may be the 
second or third methionine, so that employing the entire sequence indicated above may result in an extended protein. 
However, for the gag and env genes there will be proteolytic processing in mammalian cells, which processing may 
include the removal of extra amino acids. 
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[001 5] In isolating the different domains the provirus may be digested with restriction endonucleases, the fragments 
electrophoresed and fragments having the proper size and duplexing with a probe, when available, are isolated, cloned 
in a cloning vector, and excised from the vector. The fragments may then be manipulated for expression, Superfluous 
nucleotides may be removed from one or both termini using Bal31 digestion. By restriction mapping convenient re- 

5 striction sites may be located external or internal to the coding region. Primer repair or in vitro mutagenesis may be 
employed for defining a terminus, for insertions, deletion, point or multiple mutations, or the like, where codons may 
be changed, either cryptic or changing the amino acid, restriction sites introduced or removed, or the like. Where the 
gene has been truncated, the lost nucleotides may be replaced using an adaptor. Adaptors are particularly useful for 
joining coding regions to ensure the proper reading frame. 

io [0016] The env domain of the HIV-I genome can be obtained by digestion of the provirus with EcoRI and Kpn l and 
purification of a 3300 base pair (bp) fragment, which fragment contains about 400 bp of 5'non-coding and about 200 
bp of 3' non-coding region. Three different methionines coded for by the sequence in the 5' end of the open reading 
frame may serve as translational initiation sites. 

[0017] Digestion of proviral sequences with Sacl and EcoRV provides a fragment of about 2300 bp which contains 
*5 the gag domain and a second small open reading frame towards the 3" end of the gag region. The gag domain is about 
1500 bp and codes for a large precursor protein which is processed to yield proteins of about 25,000 (p25), 16,000 
(p16) and 12,000 (p12) daltons. Digestion with Sac l and BaJII may also be used to obtain exclusively the gag domain 
with p12, p25 and partial p16 regions. 

[0018] The polypeptides which are expressed by the above DNA sequences may find use in a variety of ways. The 
20 polypeptides or immunologically active fragments thereof, may find use as diagnostic reagents, being used in labeled 
or unlabeled form or immobilized (i.e., bound to a solid surface), as vaccines, in the production of monoclonal antibodies, 
e.g., inhibiting antibodies, or the like. 

[001 9] The DNA sequences may be joined with other sequences, such as viruses, e.g., vaccinia virus or adenovirus, 
to be used for vaccination. Particularly, the DNA sequence of the viral antigen may be inserted into the vaccinia virus 
25 at a site where it can be expressed, so as to provide an antigen of HIV-I recognized as an immunogen by the host. 
The gag or env genes or fragments thereof that encode immunogens can be used. 

[0020] Another alternative is to join the gag or env regions or portions thereof to HBsAg gene or pre-S HBsAg gene 
or immunogenic portions thereof, which portion is capable of forming particles in a unicellular microorganism host, e. 
g., yeast or mammalian cells. Thus, particles are formed which will present the HIV-I immunogen to the host in immu- 

30 nogenic form, when the host is vaccinated with assembled particles. 

[0021 ] As vaccines, the various forms of the immunogen can be administered in a variety of ways, orally, parenterally, 
intravenously, intra-arterially, subcutaneously, or the like. Usually, these will be provided in a physiologically acceptable 
vehicle, generally distilled water, phosphate-buffered saline, physiological saline, or the like. Various adjuvants may 
be included, such as aluminum hydroxide, and the dosages, number of times of administration and manner of admin- 

35 istration determined empirically. 

[0022] In order to obtain the HIV-I sequence, virus can be pelleted from the supernatant of an infected host cell. A 
9 kb RNA species is purified by electrophoresis of the viral RNA in low-melting agarose gels, followed by phenol 
extraction. The purified RNA may then be used as a template with random primers in a reverse transcriptase reaction. 
The resulting cDNA is then screened for hybridization to polyA + RNA from infected and uninfected cells. Hybridization 

40 occurring from infected, but not uninfected cells, is related to the HIV-I. 

[0023] Genomic DNA from infected cells can be restriction enzyme digested and used to prepare a bacteriophage 
library. Based upon restriction analysis of the previously obtained fragments of the retrovirus, the viral genome can be 
partially digested with Eco RI and 9 kb-1 5 kb DNA fragments isolated and employed to prepare the library. The resulting 
recombinant phage may be screened using a double-lift screening method employing the viral cDNA probe, followed 

45 by further purification, e.g., plaque-purification and propagation in large liquid cultures. From the library, the complete 
sequence of the virus can be obtained and detected with the previously described probe. 

[0024] HIV-I DNA (either provirus or cDNA) may be cloned in any convenient vector. Constructs can be prepared, 
either circular or linear, where the HIV-I DNA, either the entire hTLR or fragments thereof, may be ligated to a replication 
system functional in a microorganism host, either prokaryotic or eukaryotic cells (mammalian, yeast, arthropod, plant). 

so Micro-organism hosts include E. coli , B. subtilis , P. aerugenosa , S. cerevisiae , N. crassa, etc. Replication systems may 
be derived from ColE1 , 2 mu plasmid, \, SV40, bovine papilloma virus, or the like, that is, both plasmids and viruses. 
Besides the replication system and the HIV-I DNA, the construct will usually also include one or more markers, which 
allow for. selection of transformed or transfected hosts. Markers may include biocide resistance, e.g., resistance to 
antibiotics, heavy metais, etc., complementation in an auxotrophic host to provide prototrophy, and the like. 

55 [0025] For expression, expression vectors will be employed. For expression in microorganisms, the expression vector 
may differ from the cloning vector in having transcriptional and translational initiation and termination regulatory signal 
sequences and may or may not include a replication system which is functional in the expression host. The coding 
sequence is inserted between the initiation and termination regulatory signals so as to be under their regulatory control. 
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Expression vectors may also include the use of regulatable promoters, e.g., temperature-sensitive or inducible by 
chemicals, or genes which will allow for integration and amplification of the vector and HIV-I DNA such as tk, dhfr, 
metallothionein, or the like. 

[0026] The expression vector is introduced into an appropriate host where the regulatory signals are functional in 
5 such host. The expression host is grown in an appropriate nutrient medium, whereby the desired polypeptide is pro- 
duced and isolated from cells or from the medium when the polypeptide is secreted. 

[0027] Where a host is employed in which the HIV-I transcriptional and translational regulatory signals are functional, 
then the HIV-I DNA sequence may be manipulated to provide for expression of the desired polypeptide in proper 
juxtaposition to the regulatory signals. 

w [0028] The polypeptide products can be obtained in substantially pure form, particularly free of debris from human 
cells, which debris may include such contaminants as proteins, polysaccharides, lipids, nucleic acids, viruses, bacteria, 
fungi, etc., and combinations thereof. Generally, the polypeptide products will have less than about 0.1, usually less 
than about 0.01 weight percent, of contaminating materials from the expression host. Depending upon whether the 
desired polypeptide is produced in the cytoplasm or secreted, the manner of isolation will vary. Where the product is 

is in the cytoplasm, the cells are harvested, lysed, the product extracted and purified, using solvent extraction, chroma- 
tography, gel exclusion, electrophoresis, or the like. Where secreted, the desired product will be extracted from the 
nutrient medium and purified in accordance with the methods described above. 

[0029] The expression products of the env and gag genes and immunogenic fragments thereof having immunogenic 
sites may be used for screening antisera from patients' blood to determine whether antibodies are present which bind 

20 to HIV-I antigens. One or more of the recombinant antigens are used in the serological assay. Preferred modes of the 
assay employ a combination of gag, env, and pol antigens. A combination of p25, P31 and env recombinant antigens 
is particularly preferred. A wide variety of immunoassay techniques can be employed, involving labeled or unlabeled 
antigens or immobilized antigens. The label may be fluoresces, radionuclides, enzymes, chemiluminescers, magnetic 
particles, enzyme substrates, cofactors or inhibitors, ligands, or the like. 

25 [0030] A particularly convenient technique is to bind the antigen to a support such as the surface of an assay tube 
or well of an assay plate or a strip of material, such as nitrocellulose or nylon, that binds proteins and contact the 
sample with the immobilized antigen. After washing the support to remove non-specifically bound antisera, labeled 
antibodies to human Ig are added. The support is then washed again to remove unbound labeled anti-human Ig. The 
presence of bound analyte is then determined through detection of the label. - 

30 [0031] ELISA and "dot-blot" assays are particularly useful for screening blood or serum samples for anti-HIV-1 an- 
tibodies. The ELISA assay uses microtiter trays having wells that have been coated with the antigenic HIV-I polypep- 
tides^). The wells are also typically post-coated with a non-antigenic protein to avoid nonspecific binding of antibodies 
. in the sample to the well surface. The sample is deposited in the wells and incubated therein for a suitable period under 
conditions favorable to antigen-antibody binding. 

35 [0032] Anti-HIV-l antibodies present in the sample will bind to the antigen(s) on the well wall. The sample is then 
removed and the wells are washed to remove any residual, unbound sample. A reagent containing enzyme labeled 
antibodies to human immunoglobulin is then deposited in the wells and incubated therein to permit binding between 
the labeled anti-human Ig antibodies and HIV-I antigen-human antibody complexes bound to the well wall. Upon com- 
pletion of the incubation, the reagent is removed and the wells washed to remove unbound labeled reagent. A substyate 

40 reagent is then added to the wells and incubated therein. Enzymatic activity on the substrate is determined visually or 
spectrophotometrically and is an indication of the presence and amount of anti-HIV-l antibody-containing immune 
complex bound to the well surface. 

[0033] The "dot-blot" procedure involves using hTLR antigen(s) immobilized on a piece or strip of bibulous support 
material, such as nitrocellulose filter paper or nylon membrane, rather than antigen-coated microtiter trays. The support 

45 will also be treated subsequently with a non-antigenic protein to eliminate nonspecific binding of antibody to the support. 
The antigen-carrying support is dipped into the sample and allowed to incubate therein. Again, any anti-HIV-l antibodies 
in the sample will bind to the antigen(s) immobilized on the support. After a suitable incubation period the support is 
withdrawn from the sample and dipped repeatedly in wash buffer to remove any unbound sample from the paper. The 
support is then dipped into the enzyme-labeled antibody to human Ig reagent for a suitable incubation period. Following 

so treatment with the labeled reagent the support is dipped in wash buffer, followed by incubation in the substrate solution. 
Enzymatic activity, indicating the presence of anti-HIV-l antibody-containing complexes on the support, causes color 
changes on the support which may be detected optically. 

[0034] Either of these techniques may be modified to employ labels other than enzymes. The reading or detection 
phases will be altered accordingly. 
55 [0035] The antigenic polypeptides of HIV-I may also be used as immunogens by themselves or joined to other 
polypeptides for the production of antisera or monoclonal antibodies which may be used for therapy or diagnosis. The 
immunoglobulins may be from any mammalian source, e.g., rodent, such as rat or mouse, primate, such as baboon, 
monkey or human, or the like. For diagnosis, the antibodies can be used in conventional ways to detect HIV-I in a 
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clinical sample. 

[0036] The HIV-1 DNA sequences may also be labeled with isotopic or non-isotopic labels or markers and be used 
as DNA probes to detect the presence of native HI V-l nucleotide sequences in samples suspected of containing same. 
[0037] The following examples are offered by way of illustration and not by way of limitation. 

5 

1. AIDS related virus-2 (ARV-2) purification and preparation of viral RNA. 

[0038] HUT-78 cells infected with ARV-2 (ATCC Accession No. CRL 8597, deposited on August 7, 1984) were ob- 
tained from Dr. Jay Levy, University of California, San Francisco. Cultures were grown for two weeks in RPMI medium 

10 with 10% fetal calf serum. Cultures were centrifuged at 2 Krpm for 1 hr at 4 ° C using a SW-28 rotor. The pellet, 
containing the virus, was resuspended in 10 mM Tris-HCI, pH 7.5 on ice. The resuspended pellet was treated with 1 0 
ug of DNase (Boehringer-Mannhein) and was layered onto a linear sucrose gradient (15-50% in 10 mM Tris-HCI, pH 
7.5, 1 mM EDTA. 20 mM NaCI). The gradient was spun at 34 Krpm for 4 hr at 4°C, in SW-41 rotor. Five 2.5 ml fractions 
were collected and an aliquot of each was electrophoresed in a 1% agarose, 5 mM methyl mercury hydroxide gel 

15 (Bailey and Davidson, Anal Biochem (1976) 70:75-85) to determine which contained the 9 kb viral RNA. The fraction 
containing the viral RNA was diluted to 1 0 ml in 1 0 mM Tris-HCI, pH 7.5, 1 mM EDTA and was centrifuged at 34 Krpm 
for 2 hr at 4°C. The pellet was resuspended in 20 mM Tris-HCI, pH 7.6, 10 mM EDTA, 0.1% SDS, and 200 u,g/ml 
proteinase K. Incubation was carried out for 15 min at room temperature. The mixture was extracted with phenol and 
the aqueous phase was made 400 mM NaCI and precipitated with ethanol. The pellet was resuspended in water and 

20 stored at -70°C. 

[0039] To purify the viral RNA from the nucleic acid pellet obtained as described above, a sample was electrophoresed 
in a low-melting 1 % agarose gel containing 5 mM Methyl mercury hydroxide. After electrophoresis, the gel was stained 
with 0.1% ethidium bromide and nucleic acid bands were visualized under UV light. The region corresponding to 9 kb 
was cut from the gel and the agarose was melted at 70°C for 2 to 3 min in three volumes of 0.3 M NaCI, 10 mM Tris, 
25 pH 7.5, 1 mM EDTA. The mixture was extracted with an equal volume of phenol. The aqueous phase was reextracted 
with phenol and was precipitated with ethanol. The-pellet was washed with cold 95% ethanol, air dried, resuspended 
in water and stored at -70 °C until use. One hundred ml of culture medium yielded 0.5 to 1 fig of purified RNA. 

2. Synthesis of labeled homologous viral probe. 

30 

[0040] A 32 P-labeled cDNA was made to the gel purified viral RNA using random primers (calf thymus primers) 
prepared as described in Maniatis, et al, A Laboratory Manual, Cold spring Harbor, NY, 1982. The reaction mixture 
contained 2 uJ of 0.5 M MgCI 2 ; 5 u,l of 0.1 M dithiothreitol; 2.5 uJ each of 1 0 mM dATP, 1 0 mM dGTP and 10 mM dTTP; 
2.5 ul calf thymus primer (1 00A 260 /ml); 0.5 ug viral RNA; 5 uJ of actinomycin D (200 ug/ml); 1 0 u.1 of 32 P-dCTP (> 3000 
35 Ci/mmole, 1 mCi/ml) and 1 \i\ of AMV reverse transcriptase (17 units/uJ) in a 50 \i\ reaction volume. The reaction was 
incubated for 1 hr at 37°C. The probe was purified away from free nucleotides by gel filtration using a Sephadex® G50 
column. The void volume was pooled, NaCI was added to a final concentration of 400 mM and carrier single-stranded 
DNA to 100 ug/ml, and the cDNA was precipitated with ethanol. The pellet was resuspended in water and 
incorporated 32 P counts were determined. 

40 

3. Detection of ARV sequences in polyA+ RNA prepared from infected HUT-78 cells. 

[0041] PolyA+ RNA was prepared from HUT-78 cells infected with ARV-2, ARV-3 or ARV-4 (three different isolates 
from three different AIDS patients) and from uninfected HUT-78 cells. The polyA+ RNA was electrophoresed on 1% 
45 agarose gels containing 5 mM methyl mercury hydroxide (Bailey and Davidson, supra), was transferred to nitrocellulose 
filters, and hybridized with the homologous probe prepared as described in Section 2. Hybridizations were carried out 
in 50% formamide, 3 x SSC at 42 °C. Washes were at 50 °C in 0.2 x SSC. A 9 kbp band was present in all three 
samples of infected HUT-78 cells. This band was absent in polyA+ from uninfected cells. 

50 4. Detection of ARV sequences in infected and non-infected HUT-78 cells. 

[0042] High molecular weight DNA (chromosomal) was prepared from cultures of HUT-78 cells infected with ARV-2 
and from non-infected HUT-78 cells following the procedure of Luciw, et al, Molec and Cell Biol - (1984) 4:1260-1269. 
The DNA was digested with restriction enzyme(s), electrophoresed in 1% agarose gels and blotted onto nitrocellulose 
55 following the procedure described by Southern, (1975), supra. Blots were hybridized with the 32 P-labeled probe (10 6 
cpm/blot) in a mixture containing 50% formamide, 3 x SSC, 10 mM Hepes, pH 7.0, 100 uxj/ml denatured carrier DNA, 
100 ng/ml yeast RNA and 1 x Denhardt's for 36 hr at 42°C. Filters were washed once at room temperature in 2 x SSC 
and twice at 42°C in 0.2 x SSC, 0.1% SDS. Filters were air dried and exposed to X-Omat film using an intensifying 
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screen. 

[0043] The homologous 32 P-probe to ARV-2 hybridized specifically to two bands in the DNA from infected cells re- 
stricted with Sack These bands were absent when DNA of non-infected cells was used, indicating that the probe is 
hybridizing specifically to infected cells presumably to the provirus integrated in the chromosomal DNA. The molecular 
5 weight of the bands is approximately 5 kb and 3 kb. 

[0044] In order to determine if different enzymes would cut the proviral sequence, several other restriction digestions 
of the cell DNA were carried out using Eco RI, Sph l or Kpn l or double digestions using two of them. Southern results 
show specific bands hybridizing when DNA of infected cells is used. Figure 1 shows a schematic map of the positions 
of restriction enzyme sites in the proviral sequence, and indicates fragment sites. 

10 

5. Cloning of proviral ARV-2 DNA. 

[0045] High molecular weight cell DNA from infected HUT-78 cells was prepared following the procedure of Luciw, 
et al, supra. The DNA was digested with Eco RI, which cuts once in the provirus, centrifuged in a sucrose gradient and 

is fractions corresponding to 8-1 5 kb were pooled, dialyzed and concentrated by ethanol precipitation. The bacteriophage 
X derivative cloning vector, EMBL-4 (Karn, et al. Methods Enzymol (1 983) 101 :3-1 9) was digested to completion with 
a mixture of Eco RI, Bam HI and Sail restriction enzymes and the DNA then deproteinized by phenol-chloroform extrac- 
tion, precipitated with cold ethanol and resuspended in ligation buffer. The EMBL-4 phage DNA and EcoRI digest of 
cellular DNA were mixed and ligated and the resultant recombinant phage genomes packaged in vitro . After phage 

20 infection of X-sensitive E. coli - (DPSOsupF), about 500,000 phage plaques were transferred onto nitrocellulose filters, 
DNA was fixed and the filters were screened with a homologous 32 P-probe prepared as described in Section 2. Eleven 
recombinant phage out of 500,000 phage annealed in the initial double-lift screening method (Maniatis, et al, Molecular 
Cloning, A Laboratory Manual, NY, 1982) to viral cDNA probe, and these were further plaque-purified and propagated 
in large liquid cultures for preparation of recombinant DNA. Plaque-purified phage containing ARV DNA were propa- 

25 gated in liquid culture in E. coli DP50supF: phage particles were harvested and banded in CsCI gradients and recom- 
binant phage DNA was prepared by phenol extraction followed by ethanol precipitation (Maniatis, et al, supra). One 
ug of purified phage DNA was digested with restriction enzymes, electrophoresed on 1 % agarose gels, and visualized 
with ethidium bromide under ultraviolet light. The DNA from these gels was transferred to nitrocellulose and annealed 
with viral cDNA probe. 

30 [0046] One of the 11 phage, designated X ARV-2(9B), was deposited at the ATCC on 25 January 1985 and given 
Accession No. 401 58. X ARV-2(9B) contained an insertion of full-length proviral DNA along with flanking cell sequences. 
Digestion of X ARV-2(9B) DNA with Sad yielded viral DNA fragments of 3.8 kb and 5.7 kb. Eco RI digestion of X ARV- 
2(9B) produced virus containing DNA species at 6.4 kb and 8.0 kb; a double digest of Sad and EcoRI gave viral DNA 
fragments at 3.8 kb and 5.4 kb. This pattern is consistent with that of a provirus linked to cell DNA. 

35 [0047] In addition to X ARV-2(9B), phage was obtained that (1 ) possessed the left half of the viral genome from the 
EcoRI site in viral DNA extending into flanking cell DNA (XARV-2(8A)) and (2) phage that had the right half of the viral 
genome (XARV-2(7D)) from the EcoRI site in viral DNA extending into flanking cell DNA. Bacteriophages XARV-2(7D) 
(right) and XARV-2(8A) (left) were deposited at the ATCC on October 26. 1984 and given Accession Nos. 40143 and 
40144, respectively. 

40 

6. Polymorphism. 

[0048] To measure the relatedness of independent ARV isolates, restriction enzyme digests of DNA from HUT-78 
cells infected with ARV-3 and ARV-4 were analyzed with the probe made from cloned ARV-2 DNA. The Sac I digest of 

45 ARV-3 DNA was similar to that of ARV-2 whereas the Hindi 1 1 digests displayed different patterns. The Sac l digest and 
the Pstl digest of ARV-4 DNA differed from the corresponding digests of ARV-2 DNA. The intensity of the annealing 
signals obtained with ARV-3 and ARV-4 samples was much lower (about 1 0-fold less) than that for ARV-2 DNA probably 
as a result of the fact that fewer cells were infected in the ARV-3 and ARV-4 cultures. The viral-specific DNA fragments 
produced by Sacl treatment of ARV-3 and ARV-4 DNA totaled 9.0-9.5 kbp, a value similar to that of ARV-2 and in 

50 consonance with the RNA genome sizes. 

7. Sequencing of proviral DNA. 

[0049] Fragments or subfragments of ARV-2 DNA from X phage 9B were prepared and cloned into M13 according 
55 to conventional procedures (Maniatis, et al, supra). Sequencing was performed according to Sanger, et al, Proc Natl 
Acad Sci USA (1977) 74:5463, using the universal M13 primer or chemically synthesized primers complementary to 
ARV-2 sequence. The sequence is shown in Figure 2. 
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8. Amino acid sequence analysis of p25 and p1 6 gag coded proteins. 

[0050] ARV-2 was prepared and purified as described in Section 1 . The viral proteins were electrophoresed on an 
acrylamide gel, and the band corresponding to a 24,000 dalton or 1 6,000 dalton protein was excised from the gel and 
used for sequencing. Micro-sequence analysis was performed using Applied Biosystems model 470A protein sequenc- 
er similar to that described by Hewick, et al, J Biol Chem (1981) 256:7990-7997. Phenylthiohydantoin amino acids 
were identified by HPLC using a Beckman ultrasphere ODS column and a trifluoroacetic acid-acetonitrile buffer system 
as reported by Hawke, et al, Anal Biochem (1 982) 120:302-31 1 . Table 1 shows the first 20 amino acids from the amino 
terminus determined for p25-gag protein and Table 2 shows the first 30 amino acids for p1 6-gag protein. 



TABLE 1 





Amino-terminal sequence of p25-gag 
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TABLE 2 



Amino-terminal sequence of p1 6-gag 


Position 


Amino acid 
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(Met) 
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TABLE 2 (continued) 



Amino-terminal sequence of p16-gag. 


Position 


Amino acid 


15 


- (Cys) 


16 


Phe 


17 
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18 


- (Cys) 
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Gly 
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Lys 
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Glu 


22 


Gly 


23 


(His) 


24 


lie 


25 


Ala 


26 


(Lys) 


27 


Asn 


28 


(Giy) 


29 


(Arg) 


30 


(Ala, Leu) 



[0051] The amino acid sequence of Table 1 is predicted from the ARV-2 DNA sequence of Figure 2. Therefore, these 
25 results confirm that the indicated gag open reading frame is in fact being translated and identifies the N-termini of p25 
and p16. 

9. Expression of p25 gag protein of ARV-2 in bacteria. 

30 A. Host-vector system 

[0052] The p25 gag protein is synthesized by E. coli strain D1210 transformed with plasmid pGAG25-10. 
[0053] Plasmid pG AG25-1 0 is a pBR322 derivative which contains the sequence coding for p25 gag under transcrip- 
tional control of a hybrid tac promoter (De Boer et al, PNAS (1983), 80:21-25) derived from sequences of the trp and 
35 the lac UV5 promoters. Expression of p25 gag is induced in bacterial transformants with isopropylthiogalactoside 
(IPTG). 

[0054] E. coli D1210, a lac-repressor overproducing strain, carries the lacR and lacY+ alleles on the chromosome 
but otherwise is identical to E. coli HB1 01 (P lacl + , lacO + , lacZ + , lacY*, gal", pro*, leu*, thr, end - , hsnr, hsr, recA", rpsL") 
from which it was derived. 

40 

B. Construction of pGAG25-10. 

[0055] Plasmid pGAG25-10 was constructed by cloning a 699 bp DNA fragment coding for p25 gag into plasmid 
ptac5, according to the scheme shown in Figure 3. The vector ptac5 is a pBR322 derivative which contains the tac 
4 $ promoter. Shine Delgarno sequences, and a polylinker as a substitution of the original pBR322 sequences comprised 
between the EcoRI and Pvull restriction sites. 

[0056] The 699 bp DNA fragment codes for the complete p25 gag protein (amino acid residues 139 to 369 as num- 
bered in Figure 2), the only difference being that a methionine was added as the first amino acid in pGAG25-10 to 
allow for translational initiation. This change, as well as other changes in nucleotide sequence as indicated below, was 
so achieved by using chemical synthesis of party of the DNA fragment. The DNA fragment also includes two stop codons 
at the 3' end of the sequence. 

[0057] Figure 4 shows the nucleotide sequence cloned in pGAG25-10 and the amino acid sequence derived from 
it. DNA sequences that are not underlined in the figure were derived directly from the ARV-2(9B) cDNA. All other 
sequences were chemically synthesized or derived from vector ptac5. Changes were introduced in this DNA sequence, 
55 with respect to the original cDNA, to create or delete restriction sites, to add a methionine prior to the proline (first 
residue of p25) or to include stop codons after the last codon of p25 gag. However, as previously indicated, all changes 
in the DNA sequence, except those in the first codon, do not alter the amino acid sequence of p25 gag. 
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C. Preparation of D1210 (pGAG25-10) strain and characterization of p25 gag protein expressed by transformants. 

[0058] E. coli D1 21 0 cells are made competent for transformation following a standard protocol (Cohen et al, PNAS 
(1972) 69:2110). Transformation is performed as indicated in the protocol with 25-50 ng of pGAG25-10. The transfor- 
5 mation mix is plated on agar plates made in L-broth containing 100 u.g/ml ampicillin. Plates are incubated for 12 hr at 
37°C. 

[0059] Single ampicillin resistant colonies are transferred into 1 ml L-broth containing 1 00 ng/ml ampicillin and grown 
at 37°C. Expression of p25 gag protein is induced by adding 1 0 jxl of 1 00 mM IPTG (Sigma) to a final concentration of 
1 mM followed by incubation at 37°C for 2 hr. 
w [0060] Cells from 1 ml of induced cultures are pelleted and resuspended in 100 jxl Laemmli sample buffer. After 3 
cycles of boiling and freezing, portions of resultant lysates are analyzed on standard denaturing acrylamide gels. Pro- 
teins are visualized by staining with Coomassie blue. 

[0061] The extent of expression is initially determined by appearance of new protein bands for induced candidate 
samples compared with control. Proteins of molecular weights expected for the genes expressed comprised 2%-5% 
is of total cell protein in the highest expressing recombinants as determined by visual inspection with reference to a 
standard protein of known amount. 

[0062] Authenticity of the expressed proteins is determined by standard Western transfer of proteins to nitrocellulose 
and analysis with appropriate human or rabbit immune sera or mouse monoclonal antibodies (see E.4.a. below) or by 
ELISA assays of soluble E. coli proteins using human immune sera from AIDS patients (see E.4.b. below). 

20 

D. Fermentation process. 

D.1 . Preparation of transformant master seed stock. 

25 [0063] Transformant cells from a culture expressing high levels (3%) of p25 gag are streaked onto an L-broth plate 
containing 100 jig/ml ampicillin and the plate is incubated overnight at 37°C. A single colony is inoculated into 10 ml 
of L-broth, 1 00 fig/ml ampicillin and grown overnight at 37°C. An aliquot is used to verify plasmid structure by restriction 
mapping with Sail and Pstl. A second aliquot is used to induce expression of p25 gag and the rest of the culture is 
made 15% glycerol by adding 1/4 volume of 75% sterile glycerol. Glycerol cell stocks are aliquoted in 1 ml and quickly 

30 frozen in liquid nitrogen or dry-ice ethanol bath. These master seed stocks are stored at -70 ° C. 

D.2. Master plate/single colonies and overnight cultures. 

[0064] The master seed stock is scraped with a sterile applicator which is used to streak an L-broth plate containing 
35 100 \LQ/m\ ampicillin. Single colonies from this plate are used to inoculate 20-50 ml of L-b roth/amp, which is incubated 
at 37°C overnight. 

D.3. Fermentor inoculum. 

40 [0065] An aliquot of the overnight culture is used to inoculate larger volumes (1-6 liters) of L-broth/amp. Cells are 
incubated at 37°C overnight and reach an O.O.^q of approximately 5 prior to use as inoculum for the fermenter run. 

D. 4. Fermentation and harvest. 

45 [0066] Fermenters (capacity: 16 liters) containing 10 I of L-broth and 1 ml of antifoam are inoculated with 100-500 
ml from the inoculum culture. Cells are grown at 37°C to an O.D. of about 1. Expression of p25 gag is induced by 
addition of 100 ml of an IPTG solution (100 mM) to yield a 1 mM final concentration in the fermenter. Cells are grown 
for 3 additional hours and subsequently harvested using continuous flow centrifugation. At this step cells may be frozen 
and kept at -20°C until purification of p25 gag proceeds. Alternatively, 250 I fermenters are inoculated with 1-5 I from 

so the inoculum culture. Growth, induction, and harvest are as indicated before. 

E. Purification and characterization of p25 gag. 
E.1. Cell breakage. 

55 

[0067] Frozen E. coli cells are thawed and suspended in 2.5 volumes of lysis buffer (0.1 M sodium phosphate (NaPi). 
pH 7.5. 1 mM EDTA. 0.1 M NaCI). Cells are broken in a non-continuous system using a 300 ml glass unit of a Dyno 
Mill at 3000 rpm and 140 ml of acid-washed glass beads for 15 min. The jacketed chamber is kept cool by a -20°C 



9 



EP0181 150 B2 



ethylene glycol solution. Broken cells are centrifuged at 27.000 x g for 25 minutes to remove debris and glass beads. 
The supernatant is recovered and kept at 4°C. 

E.2. Selective protein precipitation. 

[0068] The cell extract is made 30% (NH 4 ) 2 S0 4 by slowly adding the ammonium sulfate at 4°C. The extract is stirred 
for 10 min after the final concentration is achieved, followed by contrifugation at 27.000 x g for 20 min. The pellet is 
resuspended in 1 M NaCI, 1 mM EDTA. 1% Triton® X-100, and 5% SDS, and then boiled for 5 min. 

10 E.3. Gel filtration. 

[0069] The fraction obtained by selective precipitation is submitted to gel filtration using a G50 Sephadex column 
equilibrated in 0.03 M NaPi, pH 6.8. Chromatography is developed in the same solution. Fractions are collected and 
absorbance at 280 nm is determined. Protein-containing fractions are pooled and characterized by protein gel elec- 
ts trophoresis, Western analysis, and ELISA. 

E.4. Characterization of recombinant p25 gag. 

[0070] 

20 

a. Protein gel electrophoresis. SDS-polyacrylamide gel analysis (10%-20% gradient gels) of proteins from 
pGAG25-containing cells and control cells indicated that varying levels of a protein of a molecular weight of about 
25,000 were specifically induced in cells containing p25 gag expression plasmids after derepression of the tacl 
promoter with IPTG. Identity of the p25 gag gene product was confirmed by both an enzyme-linked immunosorbent 

25 assay (ELISA, see E.4.c.) and Western immunoblot analysis (see E.4.b.) using both AIDS patient serum and a 

monoclonal antibody to viral p25 gag. 

b. Western analysis. Samples were electrophoresed under denaturing conditions on a 10%-20% polyacrylamide 
gradient gel. Samples were electro blotted onto nitrocellulose. The nitrocellulose paper was washed with a 1 :250 
dilution of AIDS patient reference serum (EW5111 , obtained from P. Feorino, Centers for Disease control, Atlanta, 

30 Georgia) and then with a 1 :500 dilution of HRP-conjugated goat antiserum to human immunoglobulin (Cappel, No. 

3201-0081). Alternatively, the nitrocellulose was washed with undiluted culture supernatant from 76C, a murine 
monoclonal antibody to ARV-2 p25 gag, and then with a 1 :500 dilution of HRP-conjugated goat antiserum to mouse 
immunoglobulin (TAGO, No. 6450). The substrate for immunoblots was HRP color development reagent containing 
4-chloro-1-naphthol. 

35 The p25 gag protein reacted with both AIDS patient reference serum and with the monoclonal antibody, while 

it shows no reactivity with the non-immune serum. 

c. ELISA. p25 gag was purified from bacterial extracts as previously described. The reactivity of sera with the 
purified protein was assayed by coating wells of microtiter plates with 0.25 u.g/ml, adding dilutions of test sera 
(positive reference serum EW5111 of human negative serum), followed by a 1:1000 dilution of HRP-conjugated 

40 goat antiserum to human immunoglobulin. p25 gag protein reacted with the positive serum with a midpoint of 

titration curve of approximately 1 :800. There was no reactivity with serum from a normal individual. 

10. Comparison of recombinant p25 gag protein and natural p25 gag protein in ELISA. 

45 [0071] The reactivity of purified recombinant p25 gag to various sera was compared to that of natural p25 gag protein 
purified by preparative polyacrylamide gel electrophoresis in an ELISA assay. For control, assays were also made 
using disrupted gradient purified virus (5 u.g/ml). 

[0072] PVC microtiter plates were incubated for 2 hr at 37°C with 10 ng/ml (50 u.l/well in 0.1 M sodium borate, pH 
9.0) of the Ig fraction of ascites from murine anti-p25 gag monoclonal antibody 76C. The plates were washed with PBS 

so and the wells were filled with 1 0% normal goat serum in PBS. Following a 30 min incubation at room temperature, the 
plates were washed with normal saline containing 0.05% Triton® X-1 00 (ST) and dilutions of the test ARV protein (50 
ul/well in ST with 1 0% goat serum [STGS]) were added to the wells. The plates were incubated for 2 hr at 37°C, washed 
with ST, and then incubated for 1 hr at 37. C with 50 u.l/well of rabbit antiserum raised against disrupted ARV (1:1000 
dilution in STGS). The wells were washed, incubated for 1 hr with 50 u.l of a 1:1 500 dilution in STGS of HRP-conjugated 

55 goat antiserum to rabbit immunoglobulin, washed, and then the wells received 50 uJ/well of substrate solution (1 50 u.g/ 
ml 2,2'-azino-di-[3-ethyibenzthiazolene sulfonic acid], 0.001% H 2 0 2 , 0.1 M citrate pH 4). The reaction was stopped 
after incubation for 30 min at 37 °C by the addition of 50 u.l/well of 10% SDS. The absorbance was read on a Flow 
Titertech ELISA reader at 414 nm. Samples were assayed in duplicate beginning at a dilution of 1:10 and by serial 
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2-fold dilutions thereafter. 

[0073] The table below summarizes the results of assays on 8 AIDS sera that scored positive in the assay with 
disrupted virus and 6 normal sera that were negative in the disrupted virus assay. 



SERUM NUMBER 


ELISA ASSAY TITER 3 




Disrupted Virus 


Recomb. p25 gag 


Viral p25 gag 


Group I: Sera Scoring As Positive in Virus ELISA b 


1 


51,200 


3,125 


3,125 


5 


12,800 


25 


25 


6 


12,800 


625 


625 


7 


12,800 


3,125 


3,125 


8 


25,600 


15,625 


15,625 


9 


12,800 


625 


625 


13 


800 


125 


125 


18 


3,200 


625 


625 


Groupll: Sera Scoring Negative in Virus ELISA b 


15 


_c 






16 








19 








21 








26 








33 









a. Reported as the reciprocal of the serum dilution that gave a signal equivalent to 50% of the maximum. 

b. Results were confirmed by immunofluorescence and immunoblotting as described previously. 

c. No detectable signal at a 1 :25 serum dilution. 

30 

[0074] These results show that p25 gag purified from bacteria behaves identically to similarly purified p25 gag from 
AIDS virus in an ELISA of the eight AIDS patient sera. The results of the ELISA show that there is a wide variation in 
the levels of anti-p25 gag antibodies and suggests that antibodies to some virus-encoded proteins may not be detected 
using conventional virus-based assay systems. 

35 

11 . Expression of p41 gag protein of ARV-2 in bacteria. 

[0075] A fusion protein of the p25 gag and p1 6 gag proteins of ARV-2, designated p41 gag, was synthesized in E. 

coli strain D1210 transformed with plasmid pGAG41-10. pGAG41-10 was constructed from plasmid pGAG25-10 as 
40 shown in Figure 3 by inserting an Sphl-Hpal fragment from the ARV-2 genome containing the sequences from the C- 

terminal p16 gag portion of the p53 gag precursor polyprotein and part of the p25 gag protein between the Sphl and 

Bam HI sites of pGAG25-10. The coding strand of the DNA sequence cloned in pGAG41-10 is shown in Figure 5. 

Transformation and induction of expression were effected by the procedures described above. The cells were treated 

and the p41 gag protein was visualized on Coomassie-stained gel as described above. The approximate molecular 
45 weight of the observed protein was 41 ,000 daltons. The protein reacted with AIDS sera and monoclonal antibody to 

p25 gag in Western and ELISA analyses carried out as above. 

12. Expression of p16 gag protein of ARV-2 in bacteria. 

so [0076] The sequence shown in Figure 6 and coding for the p1 6 gag protein was chemically synthesized using yeast- 
preferred codons. The blunt-end Sail fragment (381 bp) was cloned into Pvull-Sall digested and gel-isolated ptac5 
(see 9 and 11 above). The resulting plasmid was used to transform D1 210 cells, as in 9 above. Expression was induced 
with IPTG, and proteins were analyzed by polyacrylamide gel electrophoresis and Western analysis. A band of about 
16,000 daltons was induced by IPTG in the transformed cells. This protein showed reactivity in Western blots with 

55 immune sera from AIDS patients. No reactivity was observed with sera from normal individuals. 
[0077] A recombinant gag protein was also expressed in Cos (mammalian) cells. 
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13. Production of ARV-2 env protein by yeast. 

A. Host-vector system. 

s [0078] A partial env protein is synthesized by S. cerevisiae 21 50-2-3 transformed with plasmid pDPC303. Plasmid 
pDPC303 is a yeast expression vector which contains the sequence coding for 2/3 of the env protein as well as pBR322 
sequences including the amp R gene and 2-micron sequences including the yeast leu 2-04 gene. Expression of env is 
under regulation of the yeast pyruvate kinase promoter and terminator sequences. Yeast strain S. cerevisiae 21 50-2-3 
has the following genotype: Mat a, ade 1 , leu 2-112, cir°. This strain was obtained from Dr. Leland Hartwell, University 

10 of Washington. 

B. Construction of pDPC303, a yeast expression vector for env protein. 

[0079] Plasmid pDPC303 contains an "expression cassette 0 (described below) for env cloned into the Bam HI site of 
15 vector pCI/l. Vector pCI/l contains pBR322 and 2 micron sequences including the amp R and yeast leu 2-04 markers. 
It was derived from pJDB219d (Beggs, Nature (1 978), 275: 104) by replacing the pMB9 region with pBR322 sequences. 
[0080] The "expression cassette" for env consists of the following sequences fused together in this order (5' to 3'): 
yeast pyruvate kinase (PYK) promoter, env cDNA, and PYK terminator. The PYK promoter and terminator regions 
were derived from PYK cDNA isolated as described in Burke, et al, J Biol Chem (1 983) 258:21 93-2201 . 
20 [0081] The env fragment cloned into the expression cassette was derived from ARV-2 cDNA and comprises a 1395 
bp cDNA fragment which codes for env amino acid residues coded by nt 5857 to nt 7251 (Figure 2), In addition, there 
are 5 extra codons fused in reading frame in the 5' end, the first codon corresponding to a methionine, and 4 extra 
codons fused in reading frame at the 3' end followed by a stop codon. The extra codons were incorporated to facilitate 
cloning procedures exclusively. 
25 [0082] Figure 7 shows the coding strand of the nucleotide sequence cloned in pDPC303 and the amino acid sequence 
derived from it. DNA sequences that are not underlined in the figure were derived directly from the ARV-2 (9B) cDNA 
described above. All other sequences were either chemically synthesized or derived from the PYK vector. 

C. Preparation of 2150 (pDPC303) strain 

30 

[0083] Yeast cells S. cerevisiae 21 50-2-3 (Mat a , ade 1 , leu 2-04, cir°) were transformed as described by Hinnen et 
al (PNAS (1 978) 75:1 929-1 933) and plated onto leu- selective plates. Single colonies were inoculated into leu- selective 
media and grown to saturation. Cells were harvested and the env protein was purified and characterized as described 
below. 

35 

D. Purification and characterization of env protein. 
D.1. Cell breakage. 

40 [0084] Frozen S. cerevisiae 2150-2-3 (pDPC303) are thawed and suspended in 1 volume of lysis buffer (1 jig/ml 
pepstatin, 0.001 M PMSF, 0.001 M EDTA, 0.15 M NaCI, 0.05 M Tris-HCI pH 8.0), and 1 volume of acid-washed glass 
beads are added. Cells are broken in a non-continuous system using a 300 ml glass unit of Dyno Mill at 3000 rpm for 
10 min. The jacket is kept cool by a -20°C ethylene glycol solution. Glass beads are decanted by letting the mixture 
set for 3 minutes on ice. The cell extract is recovered and centrifuged at 18,000 rpm (39,200 x g) for 35 min. The 

45 supernatant is discarded and the precipitate (pellet 1 ) is further treated as indicated below. 

D.2. SDS extraction of insoluble material. 

[0085] Pellet 1 is resuspended in 4 volumes of Tris-HCI buffer (0.01 M Tris-HCI, pH 8.0, 0.01 M NaCI, 0.001 M PMSF, 
so 1 ug/ml pepstatin, 0.001 M EDTA, 0.1 % SDS) and extracted for 2 hr at 4°C with agitation. The solution is centrifuged 
at 6,300 x g for 15 min. The insoluble fraction (pellet 2) is resuspended in 4 volumes (360 ml) of PBS (per liter: 0.2 g 
KCI, 0.2 g KH 2 P0 4 , 8.0 g Nad, 2.9 g Na 2 HP0 4 .12H 2 0), 0.1% SDS, 0.001 M EDTA, 0.001 M PMSF, 1 ng/ml pepstatin, 
and centrifuged at 6,300 x g for 15 min. The pellet (pellet 3) is suspended in 4 volumes of PBS, 0.2% SDS, 0.001 M 
EDTA, 0.001 M PMSF, 1 ng/ml pepstatin and is extracted for 12 hr at 4°C with agitation on a tube rocker. The solution 
55 is centrifuged at 6,300 x g for 1 5 min. The soluble fraction is recovered for further purification as indicated below. (The 
pellet can be reextracted by resuspending it in 4 volumes of 2.3% SDS, 5% (5-mercaptoethanol, and boiling for 5 min. 
After boiling, the solution is centrifuged at 6,300 x g for 1 5 min. The soluble fraction is recovered for further purification.) 
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D.3. Selective precipitation and gel filtration. 

[0086] The soluble fraction is concentrated by precipitation with 30% ammonium sulfate at 4°C. The pellet (pellet 4) 
is resuspended in 2.3% SDS, 5% p-mercaptoethanol, and chromatographed on an ACA 34 (LKB Products) gel filtration 
5 column. The column is equilibrated with PBS, 0.1% SDS, at room temperature. Chromatography is developed in the 
same solution with a flow rate of 0.3 ml/min. Five ml fractions are collected, pooled and characterized by protein gel 
electrophoresis, Western analysis, and ELISA. If needed, pooled fractions are concentrated by vacuum dialysis on 
Spectrapor #2 (MW cutoff 12-14K). 

*0 D.4. Characterization of recombinant env. 

[0087] SDS polyacrylamide gel analysis (12% acrylamide gels) showed that a new 55,000 dalton protein was being 
synthesized in yeast cells transformed with the env-containing vector. The 55,000 datton protein is absent from cells 
transformed with control plasmid (vector without env insert). The identity of env was confirmed by both ELISA (see 
is 9.E.4.C) and Western analysis using AIDS patient serum. In both assays the 55,000 dalton protein showed immuno- 
reactivity. No reactivity was obtained with serum from a normal individual. 
[0088] Recombinant env was also expressed in mammalian (Cos) cells. 

14. ELISA for antibodies to HIV-I using recombinant ARV-2 polypeptides 

20 

[0089] Stock solutions of purified p25 gag protein (1.25 mg/ml in 20 mM sodium phosphate, 0.1% SDS, pH 7.2), 
purified env protein (2 mg/ml in 20 mM sodium phosphate, 0.1% SDS, pH 7.2), and purified SOD-p31 fusion protein 
(2 mg/ml in 20 mM sodium phosphate, 0.1% SDS, pH 7.2) were prepared. 

[0090] For coating microtiter plates (Dynatech Immulon I), 1 part each of the stock solutions of p25 gag, env, and 
25 SOD-p31 were added to 997 parts of borate coating buffer (0.05 M borate, pH 9.0). One hundred microliters of the 
coating solution was added to each well, and the plates were covered and incubated 2 hr at 37°C or 12 hr at 4°C. The 
coating solution was then aspirated from the wells and the plates washed 6 x with wash solution (0.1 37 M 0.8% NaCI, 
0.05% Triton® X-1 00). 

[0091] Serum samples were diluted 1:100 in dilution solution (0.1% casein, 1 mM EDTA, 1% Triton® X-1 00, 0.5 M 
30 NaCI, 0.01 % thimerosal, pH 7.5) with yeast protein (strain AB1 03.1 ) extract (1 :40 dilution, approximately 2 mg protein 
per ml in PBS containing 1 % Triton X-1 00, 2 mM PMSF, 0.01 % thimerosal) and E. coli protein extract (1 :40 dilution, 
approximately 1 mg protein per ml in PBS containing 1% Triton® X-1 00, 2 mM PMSF, 0.01% thimerosal) added to the 
dilution solution. Extraction procedures were similar to those described in 1 3 and 1 4 above but using non-recombinant 
strains. One hundred microliters of diluted serum was added to each well and incubated 30 min at 37°C. The plates 
35 were then washed 6 x with wash solution. 

[0092] Goat anti-human Ig labeled with horseradish peroxidase (Cappel) diluted 1 :8000 in dilution solution without 
added yeast and E. coli extracts were added at 100 u.l/well to the plates and incubated 30 min at 37°C. The plates 
were then washed 6 x with wash solution. Substrate solution (10 ml citrate buffer, 10.5 g citric acid/liter dH 2 0, pH to 
4.0 with 6 M NaOH), 0.1 ml ABTS (15 mg/ml 2,2'-azino-di-(3-ethyl-benzrthiazolene sulfonic acid) in dH 2 0) and 3.33 
40 uJ H 2 0 2 ) at 1 00 u.l/well was then added to the plates and the plates wrapped in foil and incubated at 37°C for 30 min. 
The reaction was then stopped by adding 50 uVwell of 10% SDS. Readings were made with a Dynatech ELISA reader 
set for dual wavelength reading: absorbance wavelength of 1 (410 nm) and reference wavelength of 4. 

Results 

45 

[0093] The following sera were tested: 

A. 89 consecutive blood donors from the Kansas City Blood Bank ("normal blood donors"): log nos. 1001-1081 , 
1085-1092. 

50 B. 52 sera from patients with lymphadenopathy syndrome (LAD) or AIDS or sexual partners of persons with LAD 

or AIDS (referred to as "contacts" )-alI obtained from UCSF AIDS Serum Bank panel: log nos. 4601-4652. 

[0094] The positive/negative cut-off used was 5 x (average background signal - signal with diluent alone) and was 
determined to be 0.195. Thus, sera with signals below 0.195 were rated (-); those above were rated (+). Each sample 
55 was also evaluated by the commercially available ABBOTT HTLV III EIA kit (Abbott Labs) and by Western analysis. 
[0095] Tests on the normal blood donor samples indicated all except one were negative in the invention ELISA. This 
normal serum scored negative in the ABBOTT HTLV 111 EIA test, but was actually positive, as confirmed by Western 
analysis. 
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The results of the tests on the 52 sera from LAD and AIDS patients and contacts are tabulated below: 



5 



10 



15 



25 



35 



45 



50 



55 



Serum No. 


Diagnosis 


ABBOTT EIA 


Invention ELISA 


Western 


4601 


Contacts 


+ 


1.89 




+ 


02 


Contacts 


- 


0.04 




- 


03 


Contacts 


+ 


1.44 




+ 


04 


Contacts 


+ 


1.92 




+ 


05 


Contacts 


- 


0.04 




- 


06 


Contacts 


+ 


>2 




+ 


07 


Contacts 


+ 


1.37 




+ 


08 


Contacts 


+ 


1 .60 


- + 


+ 


09 


Contacts 


+ 


>2 




+ 


10 


Contacts 


+ 


>2 




+ 


11 


Contacts 


+ 


1 Q4 


+ 


+ 


12 


Contacts 


+ 


>2 


+ 


+ 


13 


Contacts 


+ 


>2 


+ 


+ 


14 


Contacts 


+ 






+ 


15 


Contacts 


+ 


1 Q7 




+ 


16 


AIDS 


+ 


o fii 


+ 


+ 


17 


AIDS 


+ 




+ 


+ 


18 


AIDS 


+ 




+ 


+ 


19 


AIDS 


+ 


1 

1 .JO 


+ 


+ 


20 


AIDS 


+ 


1 .JO 




+ 


21 


AIDS 


+ 


n 7fi 


+ 


+ 


22 


AIDS 


+ 


1 74 


+ 


+ 


23 


, LAD 


+ 


1 Pfi 

1 .CX) 




+ 


24 


LAD 


+ 




+ 


+ 


25 


AIDS 


+ 


1 04 


+ 


+ 


26 


AIDS 


+ 


1 94 


+ 


+ 


27 


AIDS 


+ 


1 40 


+ 


+ 


28 


AIDS 


- 


n 07 




- 


29 


LAD 


+ 


1 


+ 


+ 


30 


Contacts 


+ 


1 .570 


+ 


+ 


31 


AIDS 


+ 


1 7fi 


+ 


+ 


32 


AIDS 


+ 


n on 


+ 


+ 


33 


AIDS 


+ 


1 fiQ 

1 .vJJJ 




+ 


34 


LAD 


+ 


1 0Q 


+ 


+ 


35 


AIDS 


+ 


1 ^4 




+ 


36 


AIDS 


+ 




+ 


+ 


37 


AIDS 


+ 


1 Qfi 

1 .570 


+ 


+ 


38 


AIDS 


- 




+ 


+ 


39 


LAD 


+ 


1 ft*; 

1 .OJ 


+ 


+ 


40 


LAD 


+ 






+ 


41 


LAD 


+ 


0.84 


+ 


+ 


42 


LAD 


+ 


1.59 


+ 


+ 


43 


LAD 


+ 


1.71 


+ 


+ 


44 


AIDS 


+ 


1.40 


+ 


+ 


45 


LAD 


+ 


>2 


+ 


+ 


46 


AIDS 


+ 


1.38 


+ 


+ 


47 


AIDS 


+ 


1.29 


+ 


+ 


48 


LAD 


+ 


1.93 


+ 


+ 


49 


LAD 


+/- 


0.48 


+ 


+ 
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(continued) 



5 



Serum No. 


Diagnosis 


ABBOTT EIA 


Invention ELISA 


Western 


50 


LAD 




0.04 






51 


LAD 




0.07 






52 


LAD 


+ 


1.92 


+ 


+ 



[0097] The above results show that the invention ELISA using recombinant ARV proteins is at least as good as the 

10 ABBOTT HTLV III EIA test or Western analysis. 

[0098] In the invention ELISA reported in this example the yeast and bacterial extracts were added to the serum to 
bind serum antibodies to yeast and bacteria to prevent such antibodies from binding to the recombinant ARV-2 proteins. 
Both yeast and bacterial extracts were required since the recombinant polypeptides included polypeptides expressed 
in yeast and polypeptides expressed in bacteria. If all the polypeptides were expressed in the same type of organism, 

15 only one extract would be needed. For instance, if a p25 gag polypeptide expressed in yeast was substituted for the 
bacterially produced p25 gag polypeptide of the example, only yeast extract would be added to the serum samples. 

15. Dot-blot assay for antibodies to HIV-I using recombinant ARV-2 polypeptides. 

20 [0099] Nitrocellular strips (0.5 x 5 cm) are spotted with 50 ng polypeptide in PBS (spotting volume 2 jil): After spotting 
the strips are dried at room temperature for 1 hr or more. The strips are then post-coated in a 5% solution of Carnation 
non-fat dry milk in PBS, 0.01% Thimerosal, for 1 5-60 min at room temperature. Each test solution sample is diluted 1 : 
50 in 0.5 ml of the post-coating solution in a test tube. A post-coated strip is then placed in the tube and incubated in 
the sample with rocking at 37°C for 1 hr. The strip is then removed from the tube and washed with post-coating solution. 

25 The strip is then incubated for 1 5 min at room temperature in goat anti-human Ig reagent labeled with horse radish 
peroxidase diluted 1:500 in post-coating solution. After incubation in the labeled antibody, the strip is washed serially 
with PBS, 1% Triton, and distilled water. The strips are developed by incubating them in substrate solution (see 23 
above) for 15 min at room temperature. 

[0100] Positive samples will cause a visually perceptible color change at the spotting site. Normal (negative) sera 
30 sample yield no color change or give a faint signal that is discernible from a positive signal. Competition assays may 
be run on sera giving faint signals to verify that they are negative. In the competition assay, polypeptide (1 0-25 ng/ml) 
is added to the test sample and incubated from 1 hr at 37°C before the strip is incubated in the sample. With authentic 
positive sera the signal is completely blocked by the added polypeptide, whereas with normal (negative) sera there is 
no change in signal. 

35 [0101] Samples of organisms that express the above-described ARV-2 p25 gag and ARV-2 env polypeptides were 
deposited at the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Maryland under the 
provisions of the Budapest Treaty. The accession numbers and dates of these deposits are listed below. 
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Expression Product 


ATCC Accession No. 


Deposit Date 


ARV-2 p25 gag 
ARV-2 env 


53246 
20769 


27 August 1985 
27 August 1985 



Claims 

45 

1. A recombinant DNA construct useful for the expression of a recombinant polypeptide in a cell containing the con- 
struct, the construct comprising control sequences which regulate transcription and translation of the recombinant 
polypeptide in the cell and a coding sequence regulated by the control sequences, wherein the coding sequence 
comprises a DNA sequence of at least about 21 bp in reading frame characterised in that the DNA sequence 

50 encodes an antigenic HIV-I gag or env amino acid sequence of Figure 2 which sequence is immunologically non- 
cross-reactive with HTLV-I and HTLV-II and is reactive with HIV-I. 

2. A recombinant DNA construct according to claim 1 , which is useful for expression in a eukaryotic cell. 
55 3. A recombinant DNA construct according to claim 1 , which is useful for expression in a yeast cell. 

4. A recombinant DNA construct according to claim 1 , which is useful for expression in a bacterial cell. 
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5. A recombinant DNA construct according to any one of claims 1 to 4, characterised in that the DNA sequence 
encodes an amino acid sequence from an env polypeptide of HIV-I. 

6. A recombinant DNA construct according to claim 5, wherein the DNA sequence encodes a complete env polypep- 
5 tide. 

7. A recombinant DNA construct according to any one of claims 1 to 4, characterised in that the DNA sequence - 
encodes an amino acid sequence from a gag polypeptide of HIV-I. 

io 8. A recombinant DNA construct according to claim 7, wherein the DNA sequence encodes a complete gag polypep- 
tide. 

9. A cell comprising a recombinant DNA construct according to any one of claims 1 to 8, wherein the cell expresses 
the antigenic HIV-I amino acid sequence and is free from other cells which do not express the antigenic HIV-I 

15 amino acid sequence. 

10. A cell according to claim 9, wherein the recombinant DNA construct comprises a replication system recognised 
by the cell. 

20 1 1 . a cell according to claim 1 0, wherein the cell is eukaryotic. 

12. A cell according to any one of claims 9 to 11 , which is a yeast. 

13. A method of producing a recombinant polypeptide comprising an antigenic HIV-I amino acid sequence wherein a 
25 population of cells according to claim 9 is cultured under conditions whereby the recombinant polypeptide is ex- 
pressed. 

14. A method according to claim 1 3, wherein the cells are eukaryotic. 

30 15. A method according to claim 1 3, wherein the cells are yeast or bacteria. 

16. An immunoassay for detecting antibodies to HIV-I in a sample suspected of containing the antibodies, character- 
ised in that at least one recombinant polypeptide is used to bind the antibodies and the recombinant polypeptide 
comprises an antigenic env or gag HIV-I amino acid sequence contained in the sequence shown in Figure 2, which 

35 polypeptide is immunologically non-cross-reactive with HTLV-I and HTLV-II. 

17. An immunoassay according to claim 16, wherein at least one env amino acid sequence and one gag amino acid 
sequence are used to bind the antibodies. 

40 18. A diagnostic reagent or immunogen capable of binding an anti-HIV-l antibody in human serum characterised in 
that said reagent or immunogen consists of an antigen comprising an immunogenic fragment of at least seven 
amino acids of an HIV-I env or gag polypeptide, which fragment is immunologically non-cross-reactive with HTLV-I 
and HTLV-II and which has a sequence contained in the sequence shown in Figure 2. 

45 19. A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant DNA 
construct according to claim 5. 

20. A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant DNA 
construct according to claim 6. 

50 

21. A recombinant polypeptide characterised in that it is produced by a cell transformed by a recombinant DNA 
construct according to claim 7. 

22. A recombinant polypeptide according to claim 21 , wherein the gag amino acid sequence comprises p16.gag. 

55 

23. A recombinant polypeptide according to claim 21 , wherein the gag amino acid sequence comprises p25 gag. 

24. A recombinant polypeptide according to claim 21 , wherein the gag amino acid sequence comprises a fusion protein 
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of p1 6 gag and'p25 gag amino acid sequences. 

25. An article of manufacture for use in an immunoassay for HIV-I antibodies characterised in that it comprises a 
solid support having bound thereto a recombinant polypeptide according to claim 1 9. 

5 

26. An article of manufacture for use in an immunoassay for HIV-I antibodies characterised in that it comprises a 
solid support having bound thereto a recombinant polypeptide according to claim 21 . 

27. A DNA sequence encoding an HIV-I polypeptide derived from a phage selected from ARV-2 (7D) (ATCC No. 401 43) 
10 and ARV-2 (8A) (ATCC No. 40144). 

28. A recombinant DNA construct capable of expressing an antigenic recombinant HIV-I polypeptide derived from 
organism ATCC No. 53246. 

'5 29. An isolated polynucleotide comprising a fragment of at least 2 1 bp from the gag or env region of the ARV-2 sequence 
of Figure 2, wherein said polynucleotide is not greater than 180 bp. 



Patentanspruche 

20 

1. Rekombinantes DNA-Konstrukt zur Expression eines rekombinanten Polypeptids in einer das Konstrukt enthai- 
tenden Zelle, wobei das Konstrukt Kontrollsequenzen umfasst, die die Transkription und die Translation des re- 
kombinanten Polypeptids in der Zelle regulieren und eine von den Kontrollsequenzen regulierte codierende Se- 
quenz, wobei die codierende Sequenz eine DNA-Sequenz von mindestens etwa 21 bp im Leserahmen umfasst, 

25 dadurch gekennzeichnet, dass die DNA-Sequenz eine antigene HIV-lgag- Oder -env-Aminosauresequenz von 

Figur 2 codiert, wobei die Sequenz immunologisch nicht kreuzreaktiv mit HTLV-I und HTLV-II und mit HIV-I reaktiv 
ist. 

2. Rekombinantes DNA-Konstrukt nach Anspruch 1 zur Expression in einer eukaryontischen Zelle. 

30 

3. Rekombinantes DNA-Konstrukt nach Anspruch 1 zur Expression in einer Hefezelle. 

4. Rekombinantes DNA-Konstrukt nach Anspruch 1 zur Expression in einer bakteriellen Zelle. 

35 5. Rekombinantes DNA-Konstrukt nach einem der Anspruche 1 bis 4, dadurch gekennzeichnet, dass die DNA-Se- 
quenz eine Aminosauresequenz eines env-Polypeptids von HIV-I codiert. 

6. Rekombinantes DNA-Konstrukt nach Anspruch 5, wobei die DNA-Sequenz ein komplettes env-Polypeptid codiert. 

40 7. Rekombinantes DNA-Konstrukt nach einem der Anspruche 1 bis 4, dadurch gekennzeichnet, dass die DNA-Se- 
quenz eine Aminosauresequenz eines gag-Polypeptids von HIV-I codiert. 

8. Rekombinantes DNA-Konstrukt nach Anspruch 7, wobei die DNA-Sequenz ein komplettes gag-Polypeptid codiert. 

45 9. Zelle, die ein rekombinantes DNA-Konstrukt nach einem der Anspruche 1 bis 8 umfasst, wobei die Zelle die anti- 
gene HIV-I -Aminosauresequenz exprimiert und frei von anderen Zellen ist, die die antigene HIV-l-Aminosaurese- 
quenz nicht exprimieren. 

10. Zelle nach Anspruch 9, wobei das rekombinante DNA-Konstrukt ein von der Zelle erkanntes Replikationssystem 
so umfasst. 

11. Zelle nach Anspruch 10, wobei die Zelle eukaryontisch ist. 

12. Zelle nach einem der Anspruche 9 bis 11 , wobei die Zelle eine Hefezelle ist. 

55 

13. Verfahren zur Herstellung eines rekombinanten Polypeptids, das eine antigene HIV-l-Aminosauresequenz um- 
fasst, wobei eine Population von Zellen nach Anspruch 9 unter Bedingungen gezuchtet wird, unter denen das 
rekombinante Polypeptid exprimiert wird. 
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14. Verfahren nach Anspruch 13, wobei die Zellen eukaryontisch sind. 

15. Verfahren nach Anspruch 13, wobei die Zellen Hefe- Oder Bakterienzellen sind. 

16. Immunassay zum Nachweis von Antikorpern gegen HIV-I in einer Probe, von der angenommen wird, dass sie die 
Antikorper enthalt, dadurch gekennzeichnet, dass mindestens ein rekombinantes Polypeptid zur Bindung der 
Antikorper verwendet wird und das rekombinante Polypeptid eine antigene env- Oder gag-HIV-l-Aminosaurese- 
quenz umfasst, die in der in Figur 2 gezeigten Sequenz enthalten ist, wobei das Polypeptid mit HTLV-I und HTLV-II 
immunologisch nicht kreuzreaktiv ist. 

17. Immunassay nach Anspruch 16, wobei mindestens eine env-Aminosauresequenz und eine gag-Aminosaurese- 
quenz zur Bindung der Antikorper verwendet werden. 

18. Diagnostisches Reagenz Oder Immunogen, fahig zur Bindung eines anti-HIV-l-Antikorpers in menschlichem Se- 
rum, dadurch gekennzeichnet, dass das Reagenz Oder Immunogen aus einem Antigen besteht, das ein immu- 
nogenes Fragment von mindestens sieben Aminosauren eines HiV-l-env- Oder -gag-Polypeptids umfasst, wobei 
das Fragment mit HTLV-I und HTLV-II immunologisch nicht kreuzreaktiv ist und eine in der in Figur 2 gezeigten 
Sequenz enthaltene Sequenz besitzt. 

19. Rekombinantes Polypeptid, dadurch gekennzeichnet, dass es von einer Zelle hergestellt wird, die mit einem 
rekombinanten DNA-Konstrukt nach Anspruch 5 transformiert ist. 

20. Rekombinantes Polypeptid, dadurch gekennzeichnet, dass es von einer Zelle hergestellt wird, die mit einem 
rekombinanten DNA-Konstrukt nach Anspruch 6 transformiert ist. 

21. Rekombinantes Polypeptid, dadurch gekennzeichnet, dass es von einer Zelle hergestellt wird, die mit einem 
rekombinanten DNA-Konstrukt nach Anspruch 7 transformiert ist. 

22. Rekombinantes Polypeptid nach Anspruch 21, wobei die gag-Aminosauresequenz p16-gag umfasst. 

23. Rekombinantes Polypeptid nach Anspruch 21, wobei die gag-Aminosauresequenz p25-gag umfasst. 

24. Rekombinantes Polypeptid nach Anspruch 21, wobei die gag-Aminosauresequenz ein Fusionsprotein aus 
p16-gag- und p25-gag-Aminosauresequenzen umfasst. 

25. Erzeugnis zur Verwendung in einem Immunassay fur HIV-l-Antikorper, dadurch gekennzeichnet, dass es einen 
festen Trager umfasst, an den ein rekombinantes Polypeptid nach Anspruch 19 gebunden ist. 

26. Erzeugnis zur Verwendung in einem Immunassay fur HIV-l-Antikorper, dadurch gekennzeichnet, dass es einen 
festen Trager umfasst, an den ein rekombinantes Polypeptid nach Anspruch 21 gebunden ist. 

27. DNA-Sequenz, die ein HIV-l-Polypeptid codiert, das von einem Phagen stammt, der ausgewahlt ist aus ARV-2 
(7D) (ATCC Nr. 40143) und ARV-2 (8A) (ATCC Nr. 40144). 

28. Rekombinantes DNA-Konstrukt, fahig zur Expression eines antigenen rekombinanten HIV-l-Polypeptids, das aus 
dem Organismus mit der ATCC Nr. 53246 stammt. 

29. Isoliertes Polynucleotid, das ein Fragment mit mindestens 21 bp der gag- oder env-Region der ARV-2-Sequenz 
aus Figur 2 umfasst, wobei das Polypeptid nicht groBer als 180 bp ist. 



Revendications 

1. Produit d'assemblage d'ADN recombinant, utile pour ('expression d'un polypeptide recombinant dans une cellule 
contenant le produit d'assemblage, le produit d'assemblage comprenant des sequences rggulatrices qui reglent 
la transcription et la traduction du polypeptide recombinant dans la cellule et une sequence codarite regutee par 
les sequences r6gulatrices, dans lequel la sequence codante comprend une sequence d'ADN d'au moins environ 
21 pb dans le cadre de lecture, caracterise en ce que la sequence d'ADN code pour une sequence antig6nique 
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d'acides amines gag ou env du VIH-I de la figure 2, ladite sequence ne presentant pas immunologiquement de 
reaction croisee avec le HTLV-I ni le HTLV-II et etant reactive avec le VIH-I. 

2. Prodult d'assemblage d'ADN recombinant selon la revendication 1 , utile pour I'expression dans une cellule euca- 
5 ryote. 

3. Produit d'assemblage d'ADN recombinant selon la revendication 1 , utile pour I'expression dans une cellule de 
levure. 

10 4. Produit d'assemblage d'ADN recombinant selon la revendication 1 , utile pour I'expression dans une cellule bac- 
terienne. 

5. Produit d'assemblage d'ADN recombinant selon Tune quelconque des revendications 1 a 4, caracterise en ce 
que la sequence d'ADN code pour une sequence d'acides amines provenant d'un polypeptide env du VIH-I. 

15 

6. Produit d'assemblage d'ADN recombinant selon la revendication 5, dans lequel la sequence d'ADN code pour un 
polypeptide env complet. 

7. Produit d'assemblage d'ADN recombinant selon I'une quelconque des revendications 1 a 4, caracterise en ce 
20 que la sequence d'ADN code pour une sequence d'acides amines provenant d'un polypeptide gag du VIH-I. 

8. Produit d'assemblage d'ADN recombinant selon la revendication 7, dans lequel la sequence d'ADN code pour un 
polypeptide gag complet. 

25 g. Cellule comprenant un produit d'assemblage d'ADN recombinant selon I'une quelconque des revendications 1 a 
8, la cellule exprimant la sequence antigenique d'acides amines du VIH-I et etant exempte d'autres cellules n'ex- 
primant pas la sequence antigenique d'acides amines du VIH-I. 

10. Cellule selon la revendication 9, le produit d'assemblage d'ADN recombinant comprenant un systeme de replication 
30 reconnu par la cellule. 

11. Cellule selon la revendication 10, la cellule etant un eucaryote. 

12. Cellule selon I'une quelconque des revendications 9 a 11 , qui est une levure. 

35 

13. Procede pour la production d'un polypeptide recombinant comprenant une sequence antigenique d'acides amines 
du VIH-I, dans lequel on cultive une population de cellules selon la revendication 9, dans des conditions dans 
lesquelles le polypeptide recombinant est exprime. 

40 14. Proc6d6 selon la revendication 13, dans lequel les cellules sont des eucaryotes. 

15. Procede selon la revendication 13, dans lequel les cellules sont des bacteries ou des levures. 

16. Analyse immunologique pour la detection d'anticorps dirig£s contre le VIH-I dans un echantillon presume contenir 
45 les anticorps, caracteris£e en ce qu'au moins un polypeptide recombinant est utilise pour fixer les anticorps et 

le polypeptide recombinant comprend une sequence antigenique d'acides amines env ou gag du VIH-I contenue 
dans la sequence representee a la figure 2, ce polypeptide ne presentant immunologiquement pas de reaction 
croisee avec le HTLV-I ni le HTLV-II. 

so 17. Analyse immunologique selon la revendication 1 6, dans laquelle au moins une sequence d'acides amines env et 
une sequence d'acides amines gag sont utilis6es pour fixer les anticorps. 

1 8. R6actif de diagnostic ou immunogene capable de fixer un anticorps asti-VIH-l dans du serum humain, caracterise 
en ce que ledit r6actif ou ledit immunogene consiste en un antigfcne comprenant un fragment immunogene d'au 
55 moins sept acides amines d'un polypeptide env ou gag du VIH-I, ce fragment ne presentant immunologiquement 

pas de reaction croisee avec le HTLV-I ni le HTLV-II et ayant une sequence contenue dans la sequence representee 
a la figure 2. 
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19. Polypeptide recombinant, caracterise en ce qu'il est produit par une cellule transformee par un produit d'assem- 
blage d'ADN recombinant selon la revendication 5. 

20. Polypeptide recombinant, caracterise en ce qu'il est produit par une cellule transformee par un produit d'assem- 
5 blage d'ADN recombinant selon la revendication 6. 

21. Polypeptide recombinant, caracterise en ce qu'il est produit par une cellule transformee par un produit d'assem- 
blage d'ADN recombinant selon la revendication 7. 

w 22. Polypeptide recombinant selon la revendication 21 , dans lequel la sequence d'acides amines gag comprend gag- 
p16. 

23. Polypeptide recombinant selon la revendication 21 , dans lequel la sequence d'acides amines gag comprend gag- 
p25. 

15 

24. Polypeptide recombinant selon la revendication 21 , dans lequel la sequence d'acides amines gag comprend une 
proteine de fusion de sequences d'acides amines gag-p16 et gag-p25. 

25. Article de fabrication pour utilisation dans une analyse immunologique pour des anticorps diriges contre le VIH-I, 
20 caracterise en ce qu'il comprend un support solide sur lequel est fix6 un polypeptide recombinant selon la re- 
vendication 19. 

26. Article de fabrication pour utilisation dans une analyse immuriologique pour des anticorps diriges contre le VIH-I, 
caracterise en ce qu'il comprend un support solide sur lequel est fix6 un polypeptide recombinant selon la re- 

25 vendication 21. 

27. Sequence d'ADN codant pour un polypeptide de VIH-I derive d'un phage choisi parmi ARV-2(7D) (ATCC n°401 43) - 
et ARV-2(8A) (ATCC n°40144). 

30 28. Produit d'assemblage d'ADN recombinant capable d'exprimer un polypeptide antig&iique recombinant du VIH-I 
d6riv6 de I'organisme ATCC n°53246. 

29. Polynucleotide isole comprenant un fragment d'au moins 21 pb de la region gag ou env de la sequence ARV-2 de 
la figure 2, ledit polynucleotide n'ayant pas plus de 180 pb. 

35 
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<U CT464A(CGCT4(tT1t«TCCCAAACAAGACAAfcAt*tClUtATC I&1tt*U l*tC«C»HCkJtuCC!ACnttCa*HtCC*t*MItCkCACC*6tCCCAWt*IC*tt)JHCCA 

111 CT44CCMHU14CT4CUtAAl( r»ftUCCA«lt«UCC>4&U*urMMtacS(e*i>6MUtU6MCA«Cii(THflA£ACCCTAtCA«CC1«C*1MMfUMUCCCC L 

}|« «*i**«AAtTSM»trtTtt*«!fl6AC»tC***CI*CCiT1»C*lt*C*lMCCCfilt»CCUC»TCCCtiC7*CI*CAl*CitCtCCtfc*tllTCS«6CITTCIlC*#6«*C1UCC6 T 

-tj Ct4C«ltTT>Ct*«t*KCCI*tCCTMtCM<.A<U6«ACTMC«TCtC?CAC*T6CT«c»Ji^«A«TCCHITIttCTtr«t6 IcTCTtKIWrUSAttACATCTUt R 

l> CCKt6*6CK»Ct«CT*»CU6tC»*CCCfcCieCTU»ttCTC*»U*AtCTIttCUfrA<.T6Cn(« StMtlfiTCTKCCtlCIHUUTCACUTfiCmciAtASAtCCCU* 

Ml WCCtHTUt!CMTtTtMAW*ICTC1*«« T46C6CCC6tlC»Ctt» CCCtl<AfctAttAtIAC#4CCAClCCtCCll!tf TCCIf criCCAt ICCCCTIt^ltAAtCtttLAtAC 

l|TSt1»Ar«&1»ttctf)yA1«*r 9 AUS«r»«Ht«S«rC»yB1yt1»U«A»plflU»ft1« tl 

tit CAAGAGtCGA4G44C44C4AClMtB*G1ACGCCAATTmBACtAICGGAG*CtASAA6«^ 

... uiMaa^^ 

lyi€1«A1nt«fiULfftM«6UtI a CUfiUAt«L7iitrliUj«LyltU(1nCUAUAIiAUAUAUCl r TlirG1fAiiiS0rI«rEl»*«Uar&lRAtMTirk*IUMl Ml 
*JI AAUAAKCtnACASAAU1AKA««AA6AUAAAACAAAAKrAAfiAAA«AtKCACACCAAACAKAectSCACCT6fiCICAfi(IU«CACCAGCCA(&TCASCCAAAATTACCCTATAKK 



«UA|Bl««KUfiljC1«H»tr*1iiMSUAl>l I ♦ WrM.lrfl FhrU-All* AI *I»-p»» U/» r*>»*l SU61 tilrUI iPb.S.rtrotl 11 .fro»«t »h»JtrAUltv lit 
lit CUAACC1ACACCtKCAAAT6C1ACA1CAUCtAtATCACCTA«*At1TTAAAT6CklCKlAAAA&TASUCAAUAAAKC1YUAKCCAWA«lAA1ACCCAt&TTTlCA«U1fA 

ltrtUtJy*Ut*fF.'etl»**i»L.-A>»1lif«#ll««Ai.T>r»»l61ytl,HlitlBAUA1*l««l6U»l»tLt«Lr«6'«TtirlUAii«6U6WA1 t A1j6l.TrpA|pAr|t«l 121 
• II fCAA\AAtCACCCACCCCACAA4A1T1 AAACACCAf KClAAACACA&TCSCS(CACATCAASCAfiCCATfiCAAATStTAAAAftA6AC1AtCAATCA46AASCT6CA6AATCtfiATA4A(TC 

nurfolHnttAUci^Pro M»»HPf«t)r«»fi«*itr f ci«>r>e*f S «l 7 s»rAipiltAi»tl y liirThrHrTiiri««ei*t1»€UlltCti1rpA«iTfif Ai*AtftVroPro 161 
Ml CATCC A4T4CA TGCACCECC T A TT AC ACC AC4CC A A AT 4A4AAAACCAA4C44A A4T4AC AT ACCAGGAACTAC TAST ACCCT TCAG4AACAAATA44A 1 46 AT GACAAAT AATCC AtCT 

M.rror-lCi 7 tUt i»T 7 r L/1 »r fi 1f piiti l«L« M Cljt«ftA«ai y tlU«*tAroli«t1yrs«rrreTl>r&«rlin««A«pntArg«UCIrfreLj«fil«^r«MtApaA«p Ml 
U«l A^tCt AtTktC»6k»»TCTAl AAAAttT ttAl AAlttl ttA»Tl Jl*A1 AAAA1 Atl AAtAATftl AT AtttCl AtCAtCAl TCI ttACfcl AAtACAAttAttAA*t6A>CttT1TAt*lA1 

^'••'••pArt^»tVrt f iT*fi..Ar|A1 1 6l-tUAlAl,ftt B A.#f.»i r »Aii,trp>l#tTRr6UTrirlt»Ltg»»1«1oA>«A1«A»»ProAipC7tl7t1lir|ltmil/» 341 
117B TAT4TAGACC4&T t C 7 AT AAA AC Tt 1 AAtAtCCfcAACAACC TTC ACAC4AT4T AAAAAAT 1 66AT4 ACAtAAACC TT6TTC6TCCAAAAT6CAAACCC ACATTfcl AAX AC T ATTTT AAAA 

ANl»«Ct 7 fr C AUAl ,Thr L .,cUCW»*t»«tT«.rAliCytC»B6>f««UI,61jPro4lytl4ll»iAU*r9»«U«BAl46»»Al«l» t tS»*'«lp«V«lThrA»«ProA1»A»» 3*1 
1341 CCAT MC4ACC A4C A4CTACACTA4-AA4AAAT4AT4ACA4C ATtTCAC44AGTGG4tCGACC CCCCCA1AAACCAA4AGTTTTGCC1GAAGCCATGA4CCAAGTAACAAATCCA4CTAAC 

ll«^tH«t«lnAr 0 sl,At«.rh,Ar v A> A Cl.Ar 9 Lytlhrl<HyiCjlfk«AA«CraS1jLr*fi>oK]7t4ttM*Ai*lf t*«flCr>ArpAl«ProArBLriLyft«>/C/*trpAr e 421 
MIB ATAATAAl EC AC ACA4ECAAT TTT A4CAACCAA AfiAAACA£TCT1AAClSlTlCAATTK1KfiCAAA£AA44^ACA1A(CCAAAAAT1CCAU4rCtCCT AGAAAAAASfitCI til GGABA 

CrlCl7Ar|U»Gi7MitlJi«tUiiAi»C7iIP^GIi«Ar V Gla«U^^ 4«l 

„ . MtPk«Ar9fiUK«pU«AUM*L««SU6l7L7AAUAP9SlBM»S*rl«rB1afiUTkrApaAU 13 

l&ll tBTBCAAGGUABGACACCAAATtAAACAtTCCACTGAGAGACAGAXt^^ 

ThrAll»raMafilaC1al«rtMAr9*l*fil7X1«61al/ft1»^^ Ml 
A«»J«^fr 0 ThrAr 9 Ar<6UL».6»i<»Al?rpST7Sl,CUA»«A,iiWrl««S«r«1.AI # 6l7AT«A»pArt*1*filxTlir» 1 l5«rPA«A»BP»«rrotU)l.T;hrt*BTrp4l« 13 
MOB ACAGtCCCACCAGAABAUGCTTCAGCmiBGBAGBAGAAJUCAACTCCCTCTCAm 

Ai8»roWfI«rCltCC 

tl7C1*"«Sl2C>*'htM*lrtV«UraCUTrrA«pG1ftll*na 143 
)*40 GSGGAATlBGABfiTTmTCAAAGUACACACTACUTCABAlACCTGTA^ 

Al«Li»L«BT> 1 rtUn.ClyC J1 T»rL..A»«^li«F^ol1 < Urfron««UT*rV4lFr«r > tl,«U,LftPr.Ctf*ilAi^1 r Pr < Lj»t»tLjtC1-T*- > »r 0 L.wThrCU U3 
ZOIB AUTGTUAXUABATTEBimiCTmAATUCCCCATTUTCCm 

tlvljsllalfiAULtvVilCUIUt/iTarGlaMctSUljiGU^^ Ml 
Xlfl AAOAA1 AAAA &C ATT A6TA(A(ATATC1 AXAtAAAT44AAAA&4AAC&4AAAAT1 TCAAAA^TT444CCT9AUATCCATAt AATACTCCA4TArTTSC TATAAAt AAAAJLA4AXACTA 

^rL7ttrpAr f L7«L»«f<TA*pPh»ArtCl»L»«At a t,«ir 8 TkrS1»AipPb€Trp«)«V«ICIiil*«C1jlUPralllsPr*A1*ST f L««L/lL7lLr«Lrf S«r*jlTkrVAl 313 
2 JOB CT AAATBCA4AAAAC TACT A 4 AT T1CAGALAACTT AATAAAACAACTCAABACTICTCSCAABTTC AKTTAC4AATACCA£ACCCC4XAfi4£TTAAAAAASAAAAAATCACT AACAGTAT 

L*iiAipfiUl7AtpAlAl7rPA«S«rV a 1ProL»«AtpL7tAtpPWAril7il7rTArAUFl»rTl>rlUrr»Sirt1«AinAtttG)«1l>rPrtfilf lUArfTprll RTvpAiafAl 303 
3471 t44AT4TtRST6A14CATACTTTTCAGTT«CTTABATAAA4ACinA4AAA4TATACTICATTTACtATACCTA4TATAAACAATGAGACACCAG6WTUGATATCAGTACAAT4TK 

L t"*r°S 1 S 61 'IlP*-'«S t 3 , »* r|tr ***4l 1 ■»«t>S»rll«cTliny» 1 t«|.MSt w^roPRalkrvTLrvCl ■AsaPraAmii H«V«* II «T»r4B1 af 3fr«»t ll«M«»t»«Y»r 343 

3141 TGCCACAttSATCGAAAGCATCACCAtrJUTATTCCAAAGTA^AT^ 

Hpl TAGGATCTGAClU(AAATACGCC4«CATI«AACAAAAAT«Bft^ 

*^**l{VrlUl«««tilp^ „, 

tin 66AT4CCTTAT4AACTCCATCCTUTAAATG4ACAGTACAGXCTATAATGCTSCCAGAAAAAGACACC TM4C1STCAATGACATACACAUTTACTG4CAAAATTG^T1G4GCAA4TC 

tWn»T7^»«Cljl'tL/»»«»L7*4UL««C7AL7»l««l*»Ar|t»7T»fi7lAl*L»«Tl»f4)«»A! n«PfOL.-rhrCI W t).Al*«l.L«.KT W V««AUCUA».Ar,C1» 443 
till A4AT1TATGC AGGGATTAAAC TAAA4C ACTTATCTAAAC I Ct TU4A44AACCA AAGCAC 1 f »f ft fTf THffM't fftGIGr TA 4AAC TGCC ABA AAACA444A4A 

3021 nCTAAMGMCtAGTAIATGAACTATAfrATSACCCATtAAJ^CTTAITABCAGAJUTAiAGAJlBCAB6B« 

.... ^•tti!KIrtttW!i*HK , *;t^^ Ml 

3141 TGAAAA£4G6AAAGTATB£AABGAT6A4>ttGTGCCXA£U 

..... **?i{"T:!r:il!S!!itt5!T^ it3 

1311 TTAAACFkCCCATACAAAA*MAA£ATBG4MGCATMT6<A?G*ASTAT14KAA« 

B isi!£tt2K!Sl!&^ 703 

III I AACCABATAAGABTBAATCABAGTTACTCACttaAJTAAtAxAfcf AtTf aaTaa>«i ACf««»Afc^Tf r«f fifcfcr *Te^ei jryayf *y IXAAAMAATTB^ABBAAATBAACAAIf AB 

3141 MAAAMIGTCABTWTBBAITMtJGAAMTAtTATTlT 

•raPre«aMaM>alrlG)allatalAtaS«K7»A«pl»it>sGtaLaaL7t^^ 
3144 UttTGTMTAGCAAAABAAATAGTAttCAKTBItAT^ 

ttall7ljtllaIltl»af4lAlatatmitalAtaS«rC1j1*rU^ „, 
SIM AAGBAJUUUTTATCCtGGUGOGTTUUIWCABT^ 

FIG. 2-1 
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M •'•••» lfiT>f n»«l»t»rAiMl"«l/»tr»««rMTirrUrT*rf*rl»HriAJi*l«C/»TrrTf>4U61,ntLjUU«Uflitei7H»»r»1/r*i»*r.«Ul*rlJ. Ml 
AIM CA|TAAAlACAATA<ATA{AiACAAltt(AA£AATT1CAtCACTACTACSSTTAAMCCKCttTTfilT9CCCAACAAT(AA<CAtAAAiriACC*f TCCCIACAATCCCCAMfittAAA 

• ll»«»fiUl«U»NlA»»Ali«l.t*#lMtf«l»tIllClyllM«lAMAM6>«AUBU«IUt»lM»»r*l»f«ISU»»*Alt»«tf»#l1tA«»A*«flitly»A/fl/» tOS 
Iff! AACIAITAAAATC tATUAlAATt^TTAAMAAAATtATAAAACASiTAAAAtJIICAMCTtAACACCTTAAAACAKAS f ACAAATWCACTATf CATCCACAJkTf TTAAAAA4AAM 

•S4A «MMATT&tMAAtACMTtCAMAMMGAATAGtACAMYMrA*t4AMUCAlAUMCTMA6MC^^ 

*!> 4 *; i ' , *M>r«U*Ycnr«i»iA>»*»Aim«M^ Ml 
4**1 «AACAMMVCCCCmMUAMACU«CMAKfTCTCTf*mW 

ArvAt^pfil/lyill mt AtAtlf C/»¥a I AlaSarArrM Ml»GI a As a All 

AIM. t6tATtAT(UMACAMT»<AMYMTUYTtTmtt 

«>M T IT 1 A YA G AC AY C AC Y AY CA AMY AC TC A 7 CC AA GMT AMT TCACAAG T AC AC ATCCCCC TAGG KGATGC T AA * TTM T AAT AAC AACA T A T TGGGG T C T CC AT ACACtAAAAAGAGAJ 

AIM TtttATTYMlCCAtW£TCCCeAlttU1l«A**Aj»^^ 

Oil ACTATAAAAAATACCATATTAAUlllACACTIAGtCCTAUTSTtJUTAtCAJ^CAASACAf AACAACCTACSA1CTCTACAATACTTC4CAC lACCACCATTAATAACACCAAAJtfU 

10*1 *CAAA6tCACmiSCCTAGT«TTMGAJUC«ACACAGMTAMTG«AACUGCCCCAMA«ACCMW^ 

SIM TTAACAMMACei«TTA«AC*TTTTCCTAGGCCATGCCTCCATA«TTAGMCAAm^ 

AMI AACieUUTMUCATmACAATIMCtCJCAiUAtA&eAMATMSCATTATTCAMA 

MIA GGAi CIC AGCCTAGCACTtCf 1 1 TAAC AaI TtC T A1 TGTAAAAACTGT TGCf TTCATTGC TACCCGTCTTTCACAACAA AACCCT1AGCCA TCTCCTATCCCAGCAAGAAGCCCACACM 

MM CCAC A^aCACC TCC TCAAAACAC^CICfcCICATCAKKC TICTCTATCAMACA&TAACIACrAAATfiTMTCXAATCTf TACAAATATTACCJtJ TASTATCATTM1ACTAATACC1JIA 

CtilyBLyMUlrtTftrttlAUIttUrtfAl 11 
MM ATACC AATACT TCTcTCCACC AT4CTAC TCATACAAT AT AGCAAAATATTAACAC AAACAAAATAMCACATTMITfiATAtAATAAEACAAAAAtCACAAGACAGYGCCAATCAAACIC 

l/t&>/ T»<-»r fl »r 9 »i.ljrrSl»»ii»L«i,»rpArgTr»ei y Tlirt««l* W lr«Al»««lL»»llttII«Cj iSar A I a Tl^ (1 «l yllauT r»*a Hh' ta Uyr T,rBl »V« t PraTa* t| 
4TM AACGGGACLAGGACCAA1 1 ATt*GCAtTYfcTGGAtA7 GGtECACCTTUYCCTYMMYGYlMTGAYCYGYACTGCYACAGAAAMYTCTGCSTtACAGYTlAf 1 ATGCAGYACCTGTi 

UptyttUAUTfcrlhf T^rl L ^l«•»♦tytA^•V•cAiplUAr$Al•^7rA»pT^^6Ut•^♦lt»A^«^ltlTfpAUX»lr^lt^AUCJ4»*^»^oT»^A^^»roAl•»retl■6^«Y•^ 91 
AIM ICCAAi6**CCiACTACCACTCTATTTUTtCiTCAKATtCT»tACCAUT6ArACAtAt«TACATAATSTHC«CCACAC»I6CCrtTtIACCCACASACCCCAACCCACAA6AJfiTA 

nH^.tl/AiPlf.lIhrCI.Ai.f i»Aiiil»«tTr»L,i*,«At.«»lt*l«l.tl»».tCt«CUAlpl»»n*i*rL«.TrpA»pCl«S«fl».ljl»r»Cy|ttU J lLt»l»>rfr« 1)1 
40M CTAlTCGCAAAlATmACAAAAUtYAACATSTMAAAAATAACArCCTACMtA&ATSCASCAMATATAATCAfiTYYkTCSBAVCAAACCCVAAASCCAlSfCTAAAATYAACCCCA 

Lt.C/tT»lT^t .u»i«C/i1hrA.pL«»6l 7 lr*Al .T^rA^TlirO^WrWrAi-TrpLjttUCl .l»*lrt6»f5UlUtitAl«Cj|J*rrp«A»«H»TfcrtlirWrllt HI 

• 1*1 CU1G16tYAtmAAAMCMCmT1TftGGGAAGfctTUTILATttCM1M7AGlAATYGMAAGMttMTAUA&MM 

*tB*«oLytUtSt*L7%tUA*»*HL«M»i>«Ar t Aj.L»«Aipt«| I «t fr* I l«AieAt«AY«t«rThrTkr Tkr As*TjrYHrAi«T»rftr«L«i HaDI iCyt AtRAr«SiM«1 (II 
AIM 'CAfcAT AACATTCAAAAASAAAATCCACtYT TTCGtAACCTTGATCf At TACCAATACATAATGCYMTACTACTACCAACTAIACCAAC TATAGGTYGAt ACA7TCTAAC AG*TCA£TC 

ltrtnrS)«AltC7i»rpi,tt«titrrhrC1«Pro|U»r«il#*l|T,rC7lIl>r»i>ol1*t1irAtA<*ll»L»«L/*Cf • A*aA«aLaiYkrF»aA*aCt*l.r»GI,*raC/«T*r Ml 
41BI AITACACACKCTftUCAAA66lATC*TlIUACCAAT!C£CA|ACATIATTtTACCCC66CI6«lttUCWrTCIAAACIBTAArAATOAACSTTCMT6«AAAA66ACC*1STAtA 

At e »«1S«rTSrtiltUC,tT»f»«1 »Ct y U»kr ? Pr 0 | lt»*TWMhrC)»L. -L» 0 t. W Al.tl7J.rL«.AI«Cl.tl«C1 11 •Ar,J«rA»pAi«Ph.T»rAlMo Ml 

ASM AAT6TC AGCA£A&1 AC AATGT AC AC Af GEAA1 TACCCCAAYAGTGTCAACYCAAtTGCTGTlAMICGCAGTCf ACXAULGAAGAGGTAGTAATf AMYCYGACAAYTTtACUAtUI 

tUlyxTfrU«n.TiUUL««Aint»witr»«1AUntAt«Cf\tfcrAr|»r«A 10 »i«Ai«T*f Ar|L,ti«r n*lyrlUt1y»r«CljArtAl |flt«( «T«rTkrGl » Ari 121 
•Ml tCTAAAACCATAA7AtTACACCTMATCAA1CVCYA«CAATYAMTClACAAAUCCCAJCAACUYACMfiMAAA 

IU I ltC'/*lpn»»r S LrHl.H,»C 7 ,*.«IUi.rAr|AUtl B Tr r A».*».Tkrl,,6UCl-I>tr«IL/lLf»L«.Ar l tl,C>-fMS)/A t «Al.L / lThrn«**T>M 111 
(Ml ATAATACMBATATAACAAAACCACAT ttTAAC ATTA£TA{ ACCACA1T&GAATAAC ACTT f ACAACACATAGTTAAAAAATT AAG-ACAACACT TTCGGA ATAATAJAACAATAt TCTTT 

AAMtMff'MrGJjGlMAlMAfilalUMIIIMM^ «11 

• AAA AATCAATCCKAGGAGGCGACtCAGMAI YtTAATtCACMYTTYAAlYGIAGAGGCCAAYTTYTCYACYGYAAl ACAAtACMCTETTYAA1AATA(ATGGAttTTAAATCACACYGAA 

GlfYiw l/t(\ y Ai«AioTArlt<ntL«a»roCi%Ar«[l«L f tGUtl«|l«A««MtT<pGUCU«tU\ r C|«AUIUtT7rAUF«-«frol1«Gly«lrGI«IU\trCr»l«t A«| 
AMI 66* AC 1 AA4CCAAAT6AC AC AATC ATAClCCCATfcTAGAATAAAACAAAf TAT AAAC AT4TCSC AMAAGTA£GAAAA£CJLATCTATUCCCTCCCATtUAGCACAAAT1AITT(TTCA 

StrAt* 1 1 aTh'GI/laulaaiaa T*rAra AipGT /G l/Th' Aialal TAfAtaAfpTarClaTa I *AtArc*roE I r CI rGI ?A| p1tat*rf AtpAl*Y rp>AralarG t via* Y/rlf i All 
)I0I ^CAAATAlTACACCGC'GCTATf AACAAGAGATGGTGGtACAAATGlAAC f AATGACACCMMICrtCAGACCTGSAGCACCAGATATGAMGACAJtTTCGACAACI&AATTATATAAA 

Tjrlyinl ] l»l/»IUtl»»rolt#«) / ll»AI«»roThrtjtAl*iyiAr t Ar|l*1»«lGI«ArfAI«L7AArtAl«»A»Il/l! t »«lGl f AlAll»irfc#L«tiS1l"Allt«6l» Ml 
)||» 7A1AAAG1AI1AA AAATT GAICC AT 1 AtGAAT A£C AtCC*£CAAGCC AAA&AMAIAGIMYM AMMGAAAMAMGC AGtGGGAAIMT AGGAMT ATGTYCCTTGGGTtCTTMGA 

Al t A l «C t rSa r T* r a* tf f A I iVa 1 1« r l« « 1 h r T» r Val C I «Al « ArfC 1 al.t«L«a}«rC t y ( 1 *t« ) C t nCI •£) *At • At«l*«L«aArfA1 1 M «GI wAlaCI MtafllA %»l 
DM GCAGCAGGAAGCACIATIGGCCCAGTOTCATrGACGCTGACCGYACACGCCAGACAAITATYGTCYGGTAYAG TCCAAC ACCAtAACAAT TTCC TCAGCCC TAT TGACACGC AACAA£1I 

IAAI CTGtTGCAACTCACAGKTGCGrtArCAAGCAGCrCCMCCAAMCrcCTGKTCVGGAMUfACC^ 

>»rT*rAlav.lP, 0 UpA».AlA*arlr,,SarA»«t. 7 iWl«aC>aAip t l«rrpAipA ta ii«iTArTr|>A«tGUTr a C1 «Ar«C)«| l«At»AMT/r TKrAiaTtr I If T 7 rlftr atl 
'»» ACCACtGCYGYStCYYGGAAYGtYMYYftMGY AATA^AICItYGGAAGACAY YIGG^IAAXATGACCTGGATGCAGTGGGAAAGAGAJIATYMCAAYY ACAXAAACACAATAT ACACC 

laalaaClaCla&arGlaAiaClaGlaClalitAi aGI aGt «Gl>L««u.a vCt al*«AtpL r t 1 rpAl ak«rt«a1rpAtaTrf r«alar ArA»aTrpU*TrvTiP t Uliil U Aft 
119* TlACTTGAAGAATCCCAGAACCAACAAMAMCAATSAACAAUIYTATTAMArTGMYAAtfMMMGYTTGTCCMTYCGTTTMCArAACMMYUCTCTCG 

f*t ll.»tt I l»»«l«1/tl/lii»»1t)yl#»Ar f Hatal*MAlaTall#»itrllaralAi«Ar|rilAr|ClaAlyT / rtarrr«la«S»f'»h»C»aTa«'Arala«»fAr*l»r» Ml 
JIM tHATAAYfiAlAGYACMGGCYTCGIAGCYrYAAMATACTfTIIKYGTCCrVrcTAYACYCMYAUGTYMMAGA^TACYCACCATrtTCArTYCAMtCCMCVCCCMYCCH 

Ar|6ljA>MiaAr,»rM«aCljll«C1t*ta*lrtl»M^ III 
'^^^^'CCCCAC AC(CCCKACK(AAICGAAiGAAGAACGlA>(A.lASACAGACAGAiGACALATCCGtYCGATlA&TGGATGGAT TCYTAGCAXYTATC YGGGAAGATCTGCGGAGCCTGt GC 

Ct.»ittS*M r rArgArpL«.Ar fi AtpL*wL«aL*iilUAlaAUAr|TArf4UUM«t.ta«lrHt«Ar«tljTrpGlaAt«U«Lyt1yrTr|YrtS«rLa«U«GtAt«r1rAnA t|| 
BOM C TC 1 T C AtC I ACCCtCGC 1 TCAGAGAfTTAC TCT YGATTGCAGCGACCACTGTCCAAATYCV CCGCCACACMCCTCG^ACCtCTCAAATATTAATcXAGrCTCCTCCACTAlTMAI I 

GUCt«laMljiA»*S-rAla»»>Sar Tr pL««A i*AI*TArA t a M»AI *«• 1 TftrCl aGI jr t»r AipArgrt 1 1 t*GI «Ti I Al aSlaAraA 1 al 7 rAr«AI a I lalaattl a 1 1 Ml » Ml 
8IM C AtGAACl AAAAAATACTCC tCT t AECTCGC fCAACGCCACACCtATAGCAGTAAC TGXCMGACAM7ACMTTATACAACTAMACAAACACC1 TA1ACACCTATTCTCCACATACAT 



AIM tGAA^YTAMCAMUlYGA^AAAGGCtYYIGCYATAAMYGGGYGM AMYMYGAAAMGYAGYAYGMlMATMTCTGtYAUAtUAAJLG^TtJWMMIGAMCtMAM 

AKI I G AGXC AG< AGCAGAtGGM YGGAAGC AG TA fC YCGAGACC TGGMAAAX ATGGAjGCMYCACMGYAGC AAT At AGCMC f AC TAA1GC YCAY TCI CtCMGC TAA\AAGCAXAAGA6M 

M«| GGAAG^GGtGMUTICCAATCAMCCtCWTACCryiAAMCCMTMCYIAtAAGUAMniAMrATTAMCMfllfYAAAACUAAGtGMM CY4GAAGCGCIAAYTTMT ■ 

Ail) tCCAAAWMUMMlCUYUKfCTaWrCMCCAC^ 

ITU TCAAGC TAAl ACC AIT ISAAC CAGAGAAGG 1 AGAAAAGGCCAATGAAG£A GAM ACAACAGCT TCTIACACCCIATAJLGCCTGCATGAMTGAVAMA^MCGAGAAAGAAGTGtTMYGT 

AI0) St*Ct J I TGACAGCAAAC TACCA TT TCATC ACA? CCCCC GACACCTCCA1 CCGGAA t AC t AC AAAGAC TGC TCACATCGACC T t TCTAXAAGGCAC 1 1 TCCCCTGAVG4ACTI TCCAMGA 

tOII CCCCrGGCtUGGCGGMCYGCGMCTG^CrceCVCAMYMYGCAYAYAAMMCTGCItllVMCtCYACYC GAvTcTCIC ICGTlAGAGCAGAIC fCACCCtCCCCCCfCTCfCCC 

1I4A 1AAC»AtMAAXCCACtCCtlAAMCTCAA2AAa6CtlGCCJICA«t«TICA MtACtCTCtGCCCCTCTGl T«TC TtAC ICTGCT AACIAGAGA ICCC TCAGACCC H T lACYCMf 
*A>H 

HA ft GTGGAAAAAUUTAGCAG 



FIG. 2-2 
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